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(57) Abstract: The present invention relates generally to the field of human genetics. Specifically, the present invention relates 
to methods and materials used to isolate and detect human diabetes mellitus predisposing gene, specifically the angiotensinogen 
(AGT) gene, some mutant alleles of which cause susceptibility to insulin-dependent diabetes mellitus (TDDM). More specifically, 
the invention relates to germline mutations in the ACT gene and their use in the diagnosis of predisposition to diabetes. The invention 
also relates to the prophylaxis and/or therapy of diabetes associated with a mutation in the AGT gene. The invention further relates 
to the screening of drugs for diabetes therapy. Finally, the invention relates to the screening of the AGT gene for mutations, which 
are useful for diagnosing the predisposition to diabetes. 
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5 TITLE OF THE INVENTION 
DIABETES GENE 



BACKGROUND OF THE INVENTION 
10 The present invention relates generally to the field of human genetics. Specifically, the 

present invention relates to methods and materials used to isolate and detect human diabetes 
mellitus predisposing gene, specifically the angiotensinogen (AGT) gene, some mutant alleles of 
which cause susceptibility to insulin-dependent diabetes melitus (IDDM). More specifically, the 
invention relates to germline mutations in the AGT gene and their use in the diagnosis of 
15 predisposition to IDDM. The invention also relates to the prophylaxis and/or therapy of diabetes 
associated with a mutation in the AGT gene. The invention further relates to the screening of 
drugs for diabetes therapy. Finally, the invention relates to the screening of the AGT gene for 
mutations, which are useful for diagnosing the predisposition to diabetes. 

The publications and other materials used herein to illuminate the background of the 
20 invention, and in particular, cases to provide additional details respecting the practice, are 
incorporated herein by reference, and for convenience, are referenced by author and date in the 
following text and respectively grouped in the appended List of References. 

Diabetes mellitus is among the most common of all metabolic disorders, affecting up to 
11% of the population by age 70. Type I diabetes (insulin dependent diabetes mellitus or 
25 IDDM) represents about 5 to 10% of this group and is the result of a progressive autoimmune 
destruction of the pancreatic P-cells with subsequent insulin deficiency. IDDM is characterized 
by a partial or complete inability to produce insulin. Patients with IDDM would die without 
daily insulin injections to control their disease. 

Few advancements in resolving the pathogenesis of diabetes were made until the mid- 
30 1970s when evidence began to accumulate to suggest that IDDM had an autoimmune 
etiopathogenesis. It is now generally accepted that IDDM results from the chronic autoimmune 
destruction of the insulin producing pancreatic P-cells. Lymphocytes and other inflammatory 
cells have been observed within the islets of Langerhans in newly diagnosed IDDM patients and 
have been found preferentially in regenerating islets composed of P-cells rather than those of 
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other cell types. This active immunological process is associated with a variety of autoantibodies 
to P-cells cytoplasmic and membrane antigens, insulin, and insulin receptors. Although the 
general mechanism by which IDDM occurs is known, IDDM becomes clinically evident only 
after the vast majority of pancreatic p-cells have been irrevocably destroyed and the individual 

5 becomes dependent upon exogenous insulin. 

In both humans and diabetes-prone non-obese diabetic (NOD) mice, genes mapping to 
the major histocompatibility complex have been associated with susceptibility to IDDM and 
shown to be very important in the disease process (Todd, 1990). Studies of NOD mice have 
mapped at least 12 other susceptibility genes to specific chromosomal locations (Prochazaka ct 

10 al., 1987; Todd et al., 1991; De Gouyon et al., 1993; Morahan et ah, 1994; Serreze et aL, 1994; 
Cornall et al., 1991; Garchon et al., 1991). In humans, markers near the insulin/insulin-like 
growth factor loci have also been associated with IDDM (Bell et al., 1984). 

Epidemiological and molecular genetic studies indicate that only 40% of the genetic 
susceptibility is due to alleles of the MHC class II genes, suggesting additional non-MHC genes 

15 are involved (Todd et al., 1988; Buzzetti. et al., 1998). Genome-wide searches and analyses of 
specific genes have identified more at least 17 loci that contribute to the disease (Buzzetti et al., 
1998; Todd, 1995; Concannon et al., 1998; Hashimoto et al., 1994; Davies et al., 1994; Mein et 
al., 1998; Verge et al., 1998). Significant evidence for linkage was reported for a ~7cM region 
on chromosome lq42-43 (Concannon et al., 1998). This region of chromosome 1 contains the 

20 angiotensinogen (AGT) gene. 

Non-insulin dependent diabetes mellitus (NIDDM) is one of the most common inherited 
diseases in man with an estimated prevalence in Caucasian populations of 8-10% (Harris et al., 
1987). It is estimated to affect more than 100 million people worldwide (King and Zimmer, 
1988; Harris et al., 1992). Clinically, NIDDM is a heterogenous disorder characterized by 

25 chronic hyperglycemia leading to progressive micro- and macrovascular lesions in the 
cardiovascular, renal and visual systems as well as diabetic neuropathy. The causes of the fasting 
hyperglycemia and/or glucose intolerance associated with this form of diabetes are not well 
understood. Unfortunately, the disease is associated with early morbidity and mortality. 
Subtypes of the disease can be identified based at least to some degree on the time of onset of 

30 the symptoms. The principal type of NIDDM occurs at a later time of onset, typically at midlife. 
Early-onset NIDDM or maturity-onset diabetes of the young (MODY) shares many features 
with the more common form(s) of NIDDM but onset occurs in early life. 
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Although most forms of NIDDM do not exhibit simple Mendelian inheritance, the 
contribution of heredity is well recognized (Rotter et al., 1990). The genetic basis of a few rare 
monogenic syndromes of NIDDM have been elucidated, but together, these syndromes account 
for a very small minority of cases (Taylor et al., 1992; Froguel et al., 1993; Steiner et al., 1990; 
5 Kadowaki et al., 1994). It is likely that the common forms of NIDDM are complex and 
heterogenous, and result when a pool of mutant genes, each of which contributes modestly and 
in a subtle way, interact with each other and with environmental, aging and behavioral 
influences to lead to the expression of the disease. This pool of genes may vary between 
populations and among individuals within a population, despite the illusion of a clinically 

1 0 homogenous phenotype. 

Certain loci have been linked to rare early-onset forms of Type II diabetes that is 
associated with chronic hyperglycemia and monogenic inheritance (i.e. maturity onset diabetes 
of the young (MODY) loci) (Bell et al., 1991; Froguel et al., 1992; Hattersley et al., 1992; 
Vaxillaire et al., 1995). The defects in the glucokinase (GCK) gene on human chromosome 7 

15 have been found to be responsible for the relatively rare MODY2 phenotype. (Froguel et al., 
1992). 

The genes responsible for MODY1 and MODY3 have been identified to be transcription 
factors hepatocyte nuclear factor 4 (HNF4) and hepatocyte nuclear factor 1 HNF1, repsectively. 
(Yamagata et al., 1996a; Yamagata et al. 1996b). The diabetes gene HNF4/MODY1 regulates 

20 expression of the AGT gene (Yanai et al., 1999). Mutations in HNF4/MODY1 have been 
demonstrated to cause predisposition to maturity onset diabetes of the young.(MODY) a 
genetically heterogenouse monogenic form of non-insulin-dependent diabetes mellitus 
(NIDDM) (Yamagata et al., 1996b; Lindner et al., 1997). 

Another locus has been identified for a rare early-onset form with mitochondrial 

25 inheritance (Van den Ouwenland et al., 1992). In addition, Harris et al. (1996) identified a locus 
of NIDDM 1 on chromosome 2 that appears to play a role in Mexican American diabetes. 
Further, Mahtani et al. (1996) report evidence of the existence of a gene on human chromosome 
12, NIDDM2, that causes NIDDM associated with low insulin secretion. The paper suggests that 
NIDDM2 and MODY3 represent different alleles of the same gene with severe mutations 

30 causing MODY3 and milder mutations giving rise to later-onset NIDDM characterized by low 
insulin secretion. 
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In view of the heterogeneity of genes associated with IDDM or NIDDM, it is desired to 
identify additional genes associated with IDDM or NIDDM for diagnostic and therapeutic 
purposes. 

5 SUMMARY OF THE INVENTION 

Tight linkage between DNA polymorphisms in and near the angiotensinogen (AGT) 
gene on chromosome 1 and diabetes in particular families was observed (Concannon, 1999). 
This invention provides the first evidence implicating specific mutations in the AGT gene with 
susceptibility to IDDM. 

10 Thus, in one aspect of the invention, novel nucleic acids containing additional genomic 

sequences are provided. These nucleic acids are set forth in SEQ ID NOs:l-5 and 8. 

In a second aspect of the invention, a method for detecting a susceptibility in an 
individual to insulin-dependent diabetes mellitus is provided. Thus, the present invention 
provides methods for determining whether a subject is at risk for developing diabetes due to a 

15 mutation in the AGT gene. This method relies on the fact that mutations in the AGT have been 
correlated by the inventors with the disease. It will be understood by those of skill in the art, 
given the disclosure of the invention that such mutations are associated with a susceptability to 
diabetes, that a variety of methods may be utilized to detect mutations in the AGT gene, 
including the mutations disclosed herein, which are associated with a susceptability to IDDM. 

20 The method can include detecting, in a tissue of the subject, the presence or absence of a 

polymorphism of the AGT gene. The detection of a polymorphism in the AGT gene may 
include ascertaining the existence of at least one of: a deletion of one or more nucleotides; an 
addition of one or more nucleotides, a substitution of one or more nucleotides; a gross 
chromosomal rearrangement; an alteration in the level of a messenger RNA transcript; the 

25 presence of a non-wild type splicing pattern of a messenger RNA transcript; a non-wild type 
level of an AGT protein; and/or an aberrant level of an AGT protein. 

For example, detecting the polymorphism can include (i) providing a probe/primer 
comprised of an oligonucleotide which hybridizes to a sense or antisense sequence of an AGT 
gene or naturally occurring mutants thereof, or 5' or 3' flanking sequences naturally associated 

30 with an AGT gene; (ii) contacting the probe/primer to an appropriate nucleic acid containing 
sample; and (iii) detecting, by hybridization of the probe/primer to the nucleic acid, the presence 
or absence of the polymorphism; e.g. wherein detecting the polymorphism comprises utilizing 
the probe/primer to determine the nucleotide sequence of an AGT gene and, optionally, of the 
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flanking nucleic acid sequences. For instance, the primer can be employed in a polymerase 
chain reaction (PCR), in a ligase chain reaction (LCR) or other amplification reactions known to 
a skilled artisan. In alternate embodiments, the level of an AGT protein is detected in an 
immunoassay using an antibody which is specifically immunoreactive with the AGT protein. 
5 In a third aspect of the invention, compounds that are agonists or antagonists of a normal 

(functional) AGT bioactivity and their use in preventing or treating diabetes are provided. For 
example, to ameliorate disease symptoms involving insufficient expression of an AGT gene 
and/or inadequate amount of functional AGT bioactivity in a subject, a gene therapeutic 
(comprising a gene encoding a functional AGT protein) or a protein therapeutic (comprising a 

10 functional AGT protein or fragment thereof) can be administered to the subject. Alternatively, 
agonists or antagonists of AGT function (wild-type or mutant) or an AGT receptor or a receptor 
for fragments of AGT can be administered. 

In a fourth aspect of the invention, compounds that are antagonists of a disease causing 
AGT bioactivity and their use in preventing or treating diabetes are provided. For example, to 

15 ameliorate disease symptoms involving expression of a mutant AGT gene or aberrant expression 
of a normal AGT gene in a subject, a therapeutically effective amount of an antisense, ribozyme 
or triple helix molecule to reduce or prevent gene expression may be administered to the subject. 
Alternatively, to ameliorate disease symptoms involving the regulation via an AGT protein or 
AGT protein fragments of an upstream or downstream element in an AGT mediated biochemical 

20 pathway (e.g. signal transduction), a therapeutically effective amount of an agonist or antagonist 
compound (e.g. small molecule, peptide, peptidomimetic, protein or antibody) that can prevent 
normal binding of the wildtype AGT protein, can induce a therapeutic effect. 

In another aspect of the invention, assays, e.g., for screening test compounds to identify 
antagonists (e.g. inhibitors), or alternatively, agonists (e.g. potentiators), of an interaction 

25 between an AGT protein and, for example, a protein or nucleic acid that binds to the AGT 
protein or fragments of AGT are provided. An exemplary method includes the steps of (i) 
combining an AGT polypeptide or bioactive fragments thereof, an AGT target molecule (such as 
an AGT ligand or nucleic acid), and a test compound, e.g., under conditions wherein, but for the 
test compound, the AGT protein and AGT target molecule are able to interact; and (ii) detecting 

30 the formation of a complex which includes the AGT protein and the target molecule either by 
directly quantitating the complex or by measuring inductive effects of the AGT protein or 
fragments of AGT protein. A statistically significant change, such as a decrease, in the 
interaction of the AGT and AGT target molecule in the presence of a test compound (relative to 
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what is detected in the absence of the test compound) is indicative of a modulation (e.g., 
inhibition or potentiation of the interaction between the AGT protein or fragments of the AGT 
protein and the target molecule). 

In a further aspect of the present invention concerns methods for modulating the 
5 transcription of certain genes in a cell by modulating AGT bioactivity, (e.g., by potentiating or 
disrupting an AGT bioactivity). In general, whether carried out in vivo, in vitro, or in situ, the 
method comprises treating the cell with an effective amount of an AGT therapeutic (agonist or 
antagonist of an AGT bioactivity) so as to alter, relative to the cell in the absence of treatment, 
the level of transcription of certain genes. Accordingly, the method can be carried out with AGT 

10 therapeutics such as peptide and peptidomimetics or other molecules identified in the above- 
referenced drug screens which agonize or antagonize the effects of a AGT bioactivity (e.g. 
transcription) of a gene which is regulated by an AGT protein. Other AGT therapeutics include 
antisense constructs for inhibiting expression of AGT proteins, and dominant negative mutants 
of AGT proteins which competitively inhibit interactions between ligands (e,g. proteins) and 

15 nucleic acids upstream and downstream of the wild-type AGT protein. 

BRIEF DESCRIPTION OF THE FIGURES 

Figures 1A-1C show a physical map of the AGT region. Fig. 1A: Microsatellite markers 
used in genotyping. The STS WU8298 used to screen the BAC library lies on BAC 1-30. Fig. 

20 IB: B ACs containing the AGT gene. Fig. 1 C: structure of AGT and alignment on BAC eontig. 

Figures 2A-2E show haplotypes and segregation of AGT mutations in multiplex type 1 
diabetes families. Shared haplotypes are in shaded boxes. Fig. 2 A: Cysl8Arg. Fig. 2B: C(96)T 
in alternative exon 1. Fig. 2C: T(286)C in alternative exon 1. Fig. 2D: Pro264Ser linked to a 
silent polymorphism is present on the haplotype with the asterisk in family 30. Fig. 2E: C(- 

25 8 1 1)T is on the same haplotype in family 30 as Pro264Ser. 

Figures 3A-3C show rare polymorphisms in AGT and expression of alternatively spliced 
transcripts. Fig. 1 A: Genomic organization of AGTznd position of rare polymorphisms found in 
diabetes mellitus families, Approximate positions of promoters for the alternative and published 
transcripts are indicated by large arrows. The positions and types of rare polymorphisms 

30 identified in multiplex diabetes families are indicated above the exons. Within the coding exons 
the numbers refer to the base pair position in the cDNA. The polymorphism at -81 1 is relative 
to the published transcription start site in exon I (Gaillard et al., 1989). A schematic of the AGT 
polypeptide is shown below. The positions of the amino acid substitutions caused by the 
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polymorphisms are indicated relative to the start of the mature peptide. Coding region of AGT 
(black box); signal sequence(ss) (gray box); Ang I precursor peptide to the octapeptide Ang II 
(black box); mature peptide (black box and hatched box). Greater than lOOkb of genomic DNA 
(dashed line). All Cys positions of AGT indicated in a circle. Fig. 3B: Alignment of 
5 angiotensinogen from human, gorilla, chimpanzee, rat, sheep, and mouse is shown. Cys 18 in 
bold; Asnl37-Cysl38-Thrl39 underlined. Fig. 3C: Alignment of additional missense 
mutations: Alal22, Pro264, Asp319, and Ala401 shown in bold. 

DETAILED DESCRIPTION OF THE INVENTION 
10 The present invention is based in part on the discovery of polymorphisms in the AGT 

gene which are linked to insulin-dependent diabetes mellitus. Based on this finding the 
invention provides therapeutic methods, compositions and diagnostics for diabetes based on 
AGT. 

The cDNA for the AGT gene and the protein sequence are set forth in GenBank 

15 accession numbers K02215 and AH002594. The latter accession shows some genomic 
sequences (about 40-60 nucleotides) surrounding the 5 exons of the AGT gene. The AGT gene 
comprises 5 exons which are shown in SEQ ID NOS:l-5, along with genomic intron or 
untranslated sequences. SEQ ID NOs: 1-5 contain additional genomic sequences not previously 
disclosed. The AGT gene encodes a protein which comprises a 33 amino acid signal peptide 

20 and a 452 amino acid mature peptide. The coding sequence for AGT compiled from the exons 
in SEQ ID NOs: 1-5 is shown in SEQ ID NO:6. The corresponding amino acid sequence for 
AGT is set forth in SEQ ID NO:7. The sequence of alternate exon 1 along with surrounding 
genomic intron or untranslated sequences is shown in SEQ ID NO:8. 

The present invention relates to AGT agonists and antagonists and their use in treating 

25 diabetes. For example, (i) nucleic acid molecules encoding functional AGT protein; (ii) nucleic 
acids that are effective antisense, ribozyme and triplex antagonists of nucleic acids encoding 
functional AGT protein; (iii) functional AGT proteins or peptides; (iv) anti-AGT antibodies; (v) 
drugs affecting wild-type or mutant AGT function or AGT interaction with an AGT receptor and 
preparations of such compositions are disclosed herein. In addition, the invention provides drug 

30 discovery assays for identifying additional agents that agonize or antagonize the biological 
function of AGT protein (e.g. by altering the interaction of AGT molecules with either 
downstream or upstream elements in the biochemical (e.g. signal transduction) pathway). 
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Moreover, the present invention provides assays for diagnosing whether a subject has or has a 
predisposition towards developing diabetes mellitus. 

Proof that any particular gene located within the genetically defined interval is a disease 
susceptability locus is obtained by finding sequences in DNA or RNA extracted from affected 

5 kindred members which create abnormal gene products or abnormal levels of gene product. 
Such disease susceptibility alleles will co-segregate with the disease in large kindreds. They 
will also be present at a much higher frequency in non-kindred individuals with the disease than 
in individuals in the general population. In identifying a disease susceptability locus, the key is 
to find polymorphisms or mutations which are serious enough to cause obvious disruption to the 

10 normal function of the gene product These mutations can take a number of forms. The most 
severe forms would be frame shift mutations or large deletions which would cause the gene to 
code for an abnormal protein or one which would significantly alter protein expression. Less 
severe disruptive mutations would include small in-frame deletions and nonconservative base 
pair substitutions which would have a significant effect on the protein produced, such as changes 

15 to or from a cysteine residue, from a basic to an acidic amino acid or vice versa, from a 
hydrophobic to hydrophilic amino acid or vice versa, or other mutations which would affect 
secondary, tertiary or quaternary protein structure. Small deletions or base pair substitutions 
could also significantly alter protein expression by changing the level of transcription, splice 
pattern, mRNA stability, or translation efficiency of the gene transcript. Silent mutations or 

20 those resulting in conservative amino acid substitutions would not generally be expected to 
disrupt protein function. Causal mutations can also be found in the promoter of the gene. These 
mutations would interfere with the binding of regulatory factors and in this way alter 
transcription of the gene and therefore change the function of the gene. 

In one aspect, the invention features probes and primers for use in a prognostic or 

25 diagnostic assay. For instance, the present invention also provides a probe/primer comprising a 
substantially purified oligonucleotide, which oligonucleotide comprises a region of nucleotide 
sequence that hybridizes under stringent conditions to at least approximately 12, preferably 25, 
more preferably 40, 50 or 75 consecutive nucleotides of sense or anti-sense sequence of AGT, 
including 5' and/or 3' untranslated regions. In preferred embodiments, the probe further 

30 comprises a label group attached thereto and able to be detected, e.g. the label group is selected 
from amongst radioisotopes, fluorescent compounds, enzymes, and enzyme co-factors. 

In a further aspect, the present invention features methods for determining whether a 
subject is at risk for developing diabetes mellitus. According to the diagnostic and prognostic 
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methods of the present invention, alteration of the wild-type AGT locus is detected. "Alteration 
of a wild-type gene" encompasses all forms of mutations including deletions, insertions and 
point mutations in the coding and noncoding regions. Deletions may be of the entire gene or of 
only a portion of the gene. Point mutations may result in stop codons, frameshift mutations or 
5 amino acid substitutions. Point mutations or deletions in the promoter can change transcription 
and thereby alter the gene function. Somatic mutations are those which occur only in certain 
tissues and are not inherited in the germline. Germline mutations can be found in any of a 
body's tissues and are inherited. The finding of AGT germline mutations thus provides 
diagnostic information. An AGT allele which is not deleted (e.g., found on the sister 
10 chromosome to a chromosome carrying an AGT deletion) can be screened for other mutations, 
such as insertions, small deletions, and point mutations. Point mutational events may occur in 
regulatory regions, such as in the promoter of the gene, or in intron regions or at intron/exon 
junctions. 

Useful diagnostic techniques include, but are not limited to fluorescent in situ 
15 hybridization (FISH), direct DNA sequencing, PFGE analysis, Southern blot analysis, single 
stranded conformation analysis (SSCA), RNase protection assay, allele-specific oligonucleotide 
(ASO), dot blot analysis and PCR-SSCP, as discussed in detail further below. Also useful is the 
recently developed technique of DNA microchip technology. In addition to the techniques 
described herein, similar and other useful techniques are also described in U.S. Patent Nos. 
20 5,837,492 and 5,800,998, each incorporated herein by reference. 

Predisposition to disease can be ascertained by testing any tissue of a human for 
mutations of the AGT gene. For example, a person who has inherited a germline AGT mutation 
would be prone to develop IDDM. This can be determined by testing DNA from any tissue of 
the person's body. Most simply, blood can be drawn and DNA extracted from the cells of the 
25 blood. In addition, prenatal diagnosis can be accomplished by testing fetal ceils, placental cells 
or amniotic cells for mutations of the AGT gene. Alteration of a wild-type AGT allele, 
whether, for example, by point mutation or deletion, can be detected by any of the means 
discussed herein. 

There are several methods that can be used to detect DNA sequence variation. Direct 
30 DNA sequencing, either manual sequencing or automated fluorescent sequencing can detect 
sequence variation. For a gene as large as AGT, manual sequencing is very labor-intensive, but 
under optimal conditions, mutations in the coding sequence of a gene are rarely missed. 
Another approach is the single-stranded conformation polymorphism assay (SSCA) (Orita et al, 
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1989). This method does not detect all sequence changes, especially if the DNA fragment size 
is greater than 200 bp, but can be optimized to detect most DNA sequence variation. The 
reduced detection sensitivity is a disadvantage, but the increased throughput possible with SSCA 
makes it an attractive, viable alternative to direct sequencing for mutation detection on a 

5 research basis. The fragments which have shifted mobility on SSCA gels are then sequenced to 
determine the exact nature of the DNA sequence variation. Other approaches based on the 
detection of mismatches between the two complementary DNA strands include clamped 
denaturing gel electrophoresis (CDGE) (Sheffield et al, 1991), heteroduplex analysis (HA) 
(White et al, 1992) and chemical mismatch cleavage (CMC) (Grompe et al, 1989). None of the 

10 methods described above will detect large deletions, duplications or insertions, nor will they 
detect a regulatory mutation which affects transcription or translation of the protein. Other 
methods which might detect these classes of mutations such as a protein truncation assay or the 
asymmetric assay, detect only specific types of mutations and would not detect missense 
mutations. A review of currently available methods of detecting DNA sequence variation can be 

15 found in a recent review by Grompe (1993). Once a mutation is known, an allele specific 
detection approach such as allele specific oligonucleotide (ASO) hybridization can be utilized to 
rapidly screen large numbers of other samples for that same mutation. 

Detection of point mutations may be accomplished by molecular cloning of the AGT 
allele(s) and sequencing the allele(s) using techniques well known in the art. Alternatively, the 

20 gene sequences can be amplified directly from a genomic DNA preparation from the tissue, 
using known techniques. The DNA sequence of the amplified sequences can then be 
determined. 

There are six well known methods for a more complete, yet still indirect, test for 
confirming the presence of a susceptibility allele: 1) single-stranded conformation analysis 

25 (SSCA) (Orita et al, 1989); 2) denaturing gradient gel electrophoresis (DGGE) (Wartell et al, 
1990; Sheffield et al, 1989); 3) RNase protection assays (Finkelstein et al, 1990; Kinszler et 
al., 1991); 4) allele-specific oligonucleotides (ASOs) (Conner et al, 1983); 5) the use of 
proteins which recognize nucleotide mismatches, such as the E. colt mutS protein (Modrich, 
1991); and 6) allele-specific PCR (Rano and Kidd, 1989). For allele-specific PCR, primers are 

30 used which hybridize at their 3' ends to a particular AGT mutation. If the particular AGT 
mutation is not present, an amplification product is not observed. Amplification Refractory 
Mutation System (ARMS) can also be used, as disclosed in European Patent Application 
Publication No. 0332435 and in Newton et al, 1989. Insertions and deletions of genes can also 
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be detected by cloning, sequencing and amplification. In addition, restriction fragment length 
polymorphism (RFLP) probes for the gene or surrounding marker genes can be used to score 
alteration of an allele or an insertion in a polymorphic fragment. Such a method is particularly 
useful for screening relatives of an affected individual for the presence of the AGT mutation 
5 found in that individual. Other techniques for detecting insertions and deletions as known in the 
art can be used. 

In the first three methods (SSCA, DGGE and RNase protection assay), a new 
electrophoretic band appears. SSCA detects a band which migrates differentially because the 
sequence change causes a difference in single-strand, intramolecular base pairing. RNase 

10 protection involves cleavage of the mutant polynucleotide into two or more smaller fragments. 
DGGE detects differences in migration rates of mutant sequences compared to wild-type 
sequences, using a denaturing gradient gel. In an allele-specific oligonucleotide assay, an 
oligonucleotide is designed which detects a specific sequence, and the assay is performed by 
detecting the presence or absence of a hybridization signal. In the mutS assay, the protein binds 

15 only to sequences that contain a nucleotide mismatch in a heteroduplex between mutant and 
wild-type sequences. 

Mismatches, according to the present invention, are hybridized nucleic acid duplexes in 
which the two strands are not 100% complementary. Lack of total homology may be due to 
deletions, insertions, inversions or substitutions. Mismatch detection can be used to detect point 

20 mutations in the gene or in its mRNA product. While these techniques are less sensitive than 
sequencing, they are simpler to perform on a large number of tumor samples. An example of a 
mismatch cleavage technique is the RNase protection method. In the practice of the present 
invention, the method involves the use of a labeled riboprobe which is complementary to the 
human wild-type AGT gene coding sequence. The riboprobe and either mRNA or DNA isolated 

25 from the tumor tissue are annealed (hybridized) together and subsequently digested with the 
enzyme RNase A which is able to detect some mismatches in a duplex RNA structure. If a 
mismatch is detected by RNase A, it cleaves at the site of the mismatch. Thus, when the 
annealed RNA preparation is separated on an electrophoretic gel matrix, if a mismatch has been 
detected and cleaved by RNase A, an RNA product will be seen which is smaller than the full 

30 length duplex RNA for the riboprobe and the mRNA or DNA. The riboprobe need not be the 
full length of the AGT mRNA or gene but can be a segment of either. If the riboprobe comprises 
only a segment of the AGT mRNA or gene, it will be desirable to use a number of these probes 
to screen the whole mRNA sequence for mismatches. 
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In similar fashion, DNA probes can be used to detect mismatches, through enzymatic or 
chemical cleavage. See, e.g., Cotton et a/., 1988; Shenk et aL 1975; Novack et al, 1986. 
Alternatively, mismatches can be detected by shifts in the electrophoretic mobility of 
mismatched duplexes relative to matched duplexes. See, e.g., Cariello, 1988. With either 

5 riboprobes or DNA probes, the cellular mRNA or DNA which might contain a mutation can be 
amplified using PCR before hybridization. Changes in DNA of the AGT gene can also be 
detected using Southern hybridization, especially if the changes are gross rearrangements, such as 
deletions and insertions. 

DNA sequences of the AGT gene which have been amplified by use of PCR may also be 

10 screened using allele-specific probes. These probes are nucleic acid oligomers, each of which 
contains a region of the AGT gene sequence harboring a known mutation. For example, one 
oligomer may be about 30 nucleotides in length (although shorter and longer oligomers are also 
usable as well recognized by those of skill in the art), corresponding to a portion of the AGT 
gene sequence. By use of a battery of such allele-specific probes, PCR amplification products 

15 can be screened to identify the presence of a previously identified mutation in the AGT gene. 
Hybridization of allele-specific probes with amplified AGT sequences can be performed, for 
example, on a nylon filter. Hybridization to a particular probe under high stringency 
hybridization conditions indicates the presence of the same mutation in the tumor tissue as in the 
allele-specific probe. 

20 The newly developed technique of nucleic acid analysis via microchip technology is also 

applicable to the present invention. In this technique, literally thousands of distinct oligonucleotide 
probes are built up in an array on a silicon chip. Nucleic acid to be analyzed is fluorescently 
labeled and hybridized to the probes on the chip. It is also possible to study nucleic acid-protein 
interactions using these nucleic acid microchips. Using this technique one can determine the 

25 presence of mutations or even sequence the nucleic acid being analyzed or one can measure 
expression levels of a gene of interest The method is one of parallel processing of many, even 
thousands, of probes at once and can tremendously increase the rate of analysis. Several papers 
have been published which use this technique. Some of these are Hacia et aL, 1996: Shoemaker et 
aL, 1996; Chee et aL, 1996; Lockhart et aL, 1996; DeRisi et aL, 1996; Lipshutz et aL, 1995. This 

30 method has already been used to screen people for mutations in the breast cancer gene BRCA1 
(Hacia et aL, 1996). This new technology has been reviewed in a news article in Chemical and 
Engineering News (Borman, 1996) and been the subject of an editorial (Nature Genetics, 1996). 
Also see Fodor (1997). 
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The most definitive test for mutations in a candidate locus is to directly compare 
genomic AGT sequences from disease patients with those from a control population. 
Alternatively, one could sequence messenger RNA after amplification, e.g., by PCR, thereby 
eliminating the necessity of determining the exon structure of the candidate gene. 
5 Mutations from disease patients falling outside the coding region of AGT can be detected 

by examining the non-coding regions, such as introns and regulatory sequences near or within 
the AGT gene. An early indication that mutations in noncoding regions are important may come 
from Northern blot experiments that reveal messenger RNA molecules of abnormal size or 
abundance in disease patients as compared to control individuals. 

10 Alteration of AGT mRNA expression can be detected by any techniques known in the 

art. These include Northern blot analysis, PCR amplification and RNase protection. Diminished 
or increased mRNA expression indicates an alteration of the wild-type AGT gene. Alteration of 
wild-type AGT genes can also be detected by screening for alteration of wild-type AGT protein. 
For example, monoclonal antibodies immunoreactive with AGT can be used to screen a tissue. 

15 Lack of cognate antigen would indicate an AGT mutation. Antibodies specific for products of 
mutant alleles could also be used to detect mutant AGT gene product. Such immunological 
assays can be done in any convenient formats known in the art. These include Western blots, 
immunohistochemical assays and ELISA assays. Any means for detecting an altered AGT 
protein can be used to detect alteration of wild-type AGT genes. Functional assays, such as 

20 protein binding determinations, can be used. In addition, assays can be used which detect AGT 
biochemical function. Finding a mutant AGT gene product indicates alteration of a wild-type 
AGT gene. 

The primer pairs of the present invention are useful for determination of the nucleotide 
sequence of a particular AGT allele using PCR. The pairs of single-stranded DNA primers can 

25 be annealed to sequences within or surrounding the AGT gene on chromosome 1 in order to 
prime amplifying DNA synthesis of the AGT gene itself. A complete set of these primers 
allows synthesis of all of the nucleotides of the AGT gene coding sequences, i.e., the exons. 
The set of primers preferably allows synthesis of both intron and exon sequences. Allele-specific 
primers can also be used. Such primers anneal only to particular AGT mutant alleles, and thus 

30 will only amplify a product in the presence of the mutant allele as a template. 

In order to facilitate subsequent cloning of amplified sequences, primers may have 
restriction enzyme site sequences appended to their 5 f ends. Thus, all nucleotides of the primers 
are derived from AGT sequences or sequences adjacent to AGT, except for the few nucleotides 
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necessary to form a restriction enzyme site. Such enzymes and sites are well known in the art. 
The primers themselves can be synthesized using techniques which are well known in the art. 
Generally, the primers can be made using oligonucleotide synthesizing machines which are 
commercially available. Given the known sequences of the AGT exons and the 5' alternate 
5 exon, the design of particular primers is well within the skill of the art. Suitable primers for 
mutation screening are also described herein. 

The nucleic acid probes provided by the present invention are useful for a number of 
purposes. They can be used in Southern hybridization to genomic DNA and in the RNase 
protection method for detecting point mutations already discussed above. The probes can be 
10 used to detect PCR amplification products. They may also be used to detect mismatches with the 
AGT gene or mRNA using other techniques. 

It has been discovered that individuals with the wild-type AGT gene do not have IDDM 
which results from the AGT allele. However, mutations which interfere with the function of the 
AGT protein are involved in the susceptability to IDDM as shown herein. Thus, the presence of 
1 5 an altered (or a mutant) AGT gene which produces a protein having a loss of function, or altered 
function, directly correlates to an increased risk of disease. In order to detect an AGT gene 
mutation, a biological sample is prepared and analyzed for a difference between the sequence of 
the AGT allele being analyzed and the sequence of the wild-type AGT allele. Mutant AGT 
alleles can be initially identified by any of the techniques described above. The mutant alleles 
20 are then sequenced to identify the specific mutation of the particular mutant allele. 
Alternatively, mutant AGT alleles can be initially identified by identifying mutant (altered) AGT 
proteins, using conventional techniques. The mutant alleles are then sequenced to identify the 
specific mutation for each allele. The mutations, especially those which lead to an altered 
function of the AGT protein, are then used for the diagnostic methods of the present invention. 
25 The present invention employs definitions commonly used in the art with specific 

reference to the gene described in the present application. Such definitions can be found in U.S. 
Patent Nos. 5,837,492 and 5,800,998, each incorporated herein by reference. Such definitions 
are employed herein unless the context indicates otherwise. 
Methods of Use: Nucleic Acid D iagnosis and Diagnostic Kits 
30 In order to detect the presence of an AGT allele predisposing an individual to diabetes, a 

biological sample such as blood is prepared and analyzed for the presence or absence of 
predisposing alleles of AGT. Results of these tests and interpretive information are returned to 
the health care provider for communication to the tested individual. Such diagnoses may be 
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performed by diagnostic laboratories, or, alternatively, diagnostic kits are manufactured and sold 
to health care providers or to private individuals for self-diagnosis. Diagnositic or prognostic 
tests can be performed as described herein or using well known techniques, such as described in 
U.S. Patent No. 5,800,998, incorporated herein by reference. 
5 Initially, the screening method involves amplification of the relevant AGT sequences. In 

another preferred embodiment of the invention, the screening method involves a non-PCR based 
strategy. Such screening methods include two-step label amplification methodologies that are 
well known in the art. Both PCR and non-PCR based screening strategies can detect target 
sequences with a high level of sensitivity. 

10 The most popular method used today is target amplification. Here, the target nucleic acid 

sequence is amplified with polymerases. One particularly preferred method using polymerase- 
driven amplification is the polymerase chain reaction (PCR). The polymerase chain reaction and 
other polymerase-driven amplification assays can achieve over a million-fold increase in copy 
number through the use of polymerase-driven amplification cycles. Once amplified, the resulting 

15 nucleic acid can be sequenced or used as a substrate for DNA probes. 

When the probes are used to detect the presence of the target sequences (for example, in 
screening for diabetes susceptibility), the biological sample to be analyzed, such as blood or 
serum, may be treated, if desired, to extract the nucleic acids. The sample nucleic acid may be 
prepared in various ways to facilitate detection of the target sequence; e.g. denaturation, 

20 restriction digestion, electrophoresis or dot blotting. The targeted region of the analyte nucleic 
acid usually must be at least partially single-stranded to form hybrids with the targeting 
sequence of the probe. If the sequence is naturally single-stranded, denaturation will not be 
required. However, if the sequence is double-stranded, the sequence will probably need to be 
denatured. Denaturation can be carried out by various techniques known in the art. 

25 Analyte nucleic acid and probe are incubated under conditions which promote stable 

hybrid formation of the target sequence in the probe with the putative targeted sequence in the 
analyte. The region of the probes which is used to bind to the analyte can be made completely 
complementary to the targeted region of human chromosome 1. Therefore, high stringency 
conditions are desirable in order to prevent false positives. However, conditions of high 

30 stringency are used only if the probes are complementary to regions of the chromosome which 
are unique in the genome. The stringency of hybridization is determined by a number of factors 
during hybridization and during the washing procedure, including temperature, ionic strength, 
base composition, probe length, and concentration of formamide. These factors are outlined in, 
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for -example, Maniatis et aL, 1982 and Sambrook €t aL, 1989. Under certain circumstances, the 
formation of higher order hybrids, such as triplexes, quadraplexes, etc., may be desired to 
provide the means of detecting target sequences. 

Detection, if any, of the resulting hybrid is usually accomplished by the use of labeled 

5 probes. Alternatively, the probe may be unlabeled, but may be detectable by specific binding 
with a ligand which is labeled, either directly or indirectly. Suitable labels, and methods for 
labeling probes and ligands are known in the art, and include, for example, radioactive labels 
which may be incorporated by known methods (e.g., nick translation, random priming or 
kinasing), biotin, fluorescent groups, chemiluminescent groups (e.g., dioxetanes, particularly 

10 triggered dioxetanes), enzymes, antibodies and the like. Variations of this basic scheme are 
known in the art, and include those variations that facilitate separation of the hybrids to be 
detected from extraneous materials and/or that amplify the signal from the labeled moiety. A 
number of these variations are reviewed in, e.g., Matthews and Kricka, 1988; Landegren et aL, 
1988; Mittlin, 1989; U.S. Patent 4,868,105, and inEPO Publication No. 225,807. 

15 As noted above, non-PCR based screening assays are also contemplated in this 

invention. This procedure hybridizes a nucleic acid probe (or an analog such as a methyl 
phosphonate backbone replacing the normal phosphodiester), to the low level DNA target. This 
probe may have an enzyme covalently linked to the probe, such that the covalent linkage does 
not interfere with the specificity of the hybridization. This enzyme-probe-conjugate-target 

20 nucleic acid complex can then be isolated away from the free probe enzyme conjugate and a 
substrate is added for enzyme detection. Enzymatic activity is observed as a change in color 
development or luminescent output resulting in a 10 3 -10 6 increase in sensitivity. For an example 
relating to the preparation of oligodeoxynucleotide-alkaline phosphatase conjugates and their 
use as hybridization probes see Jablonski et aL, 1986. 

25 Two-step label amplification methodologies are known in the art. These assays work on 

the principle that a small ligand (such as digoxigenin, biotin, or the like) is attached to a nucleic 
acid probe capable of specifically binding AGT. Allele specific probes are also contemplated 
within the scope of this example and exemplary allele specific probes include probes 
encompassing the predisposing or potentially predisposing mutations summarized in herein. 

30 In one example, the small ligand attached to the nucleic acid probe is specifically 

recognized by an antibody-enzyme conjugate. In one embodiment of this example, digoxigenin 
is attached to the nucleic acid probe. Hybridization is detected by an antibody-alkaline 
phosphatase conjugate which turns over a chemiluminescent substrate. For methods for labeling 
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nucleic acid probes according to this embodiment see Martin et ai, 1990. In a second example, 
the small ligand is recognized by a second ligand-enzyme conjugate that is capable of 
specifically complexing to the first ligand. A well known embodiment of this example is the 
biotin-avidin type of interactions. For methods for labeling nucleic acid probes and their use in 
5 biotin-avidin based assays see Rigby et ai , 1 977 and Nguyen et ai , 1 992. 

It is also contemplated within the scope of this invention that the nucleic acid probe 
assays of this invention will employ a cocktail of nucleic acid probes capable of detecting AGT. 
Thus, in one example to detect the presence of AGT in a cell sample, more than one probe 
complementary to AGT is employed and in particular the number of different probes is 

10 alternatively 2, 3, or 5 different nucleic acid probe sequences. In another example, to detect the 
presence of mutations in the AGT gene sequence in a patient, more than one probe 
complementary to AGT is employed where the cocktail includes probes capable of binding to the 
allele-specific mutations identified in populations of patients with alterations in AGT In this 
embodiment, any number of probes can be used, and will preferably include probes 

15 corresponding to the major gene mutations identified as predisposing an individual to diabetes. 
Some candidate probes contemplated within the scope of the invention include probes that 
include the allele-specific mutations identified herein and those that have the AGT regions 
corresponding to SEQ ID NOs:l-5 and 8 both 5* and 3' to the mutation site. 
Methods of Use: Peptide Diagnosis and Diagnostic Kits 

20 Susceptibility to daibetes can also be detected on the basis of the alteration of wild-type 

AGT polypeptide. Peptide diagnositic or prognostic tests can be performed as described herein 
or using well known techniques, such as described in U.S. Patent No. 5,800,998, incorporated 
herein by reference. For example, such alterations can be determined by sequence analysis in 
accordance with conventional techniques. More preferably, antibodies (polyclonal or 

25 monoclonal) are used to detect differences in, or the absence of, AGT peptides. The antibodies 
may be prepared in accordance with conventional techniques. Other techniques for raising and 
purifying antibodies are well known in the art and any such techniques may be chosen to 
achieve the preparations claimed in this invention. In a preferred embodiment of the invention, 
antibodies will immunoprecipitate AGT proteins or fragments of the AGT protein from solution 

30 as well as react with AGT peptides on Western or immunoblots of polyacrylamide gels. In 
another preferred embodiment, antibodies will detect AGT proteins and protein fragments in 
paraffin or frozen tissue sections, using immunocytochemical techniques. 
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Preferred embodiments relating to methods for detecting AGT or its mutations include 
enzyme linked immunosorbent assays (ELISA), radioimmunoassays (RIA), immunoradiometric 
assays (IRMA) and immunoenzymatic assays (IEMA), including sandwich assays using 
monoclonal and/or polyclonal antibodies. Exemplary sandwich assays are described by David et 

5 al in U.S. Patent Nos. 4,376,1 10 and 4,486,530, hereby incorporated by reference. 
Methods of Use: Drug Screening 

This invention is particularly useful for screening compounds by using a wild-type or 
mutant AGT polypeptide or binding fragment thereof in any of a variety of drug screening 
techniques. Drug screening can be performed as described herein or using well known 

10 techniques, such as described in U.S. Patent Nos. 5,800,998 and 5,891,628, each incorporated 
herein by reference. 

The AGT polypeptide or fragment employed in such a test may either be free in solution, 
affixed to a solid support, or borne on a cell surface. One method of drug screening utilizes 
eucaryotic or procaryotic host cells which are stably transformed with recombinant 

15 polynucleotides expressing the polypeptide or fragment, preferably in competitive binding 
assays. Such cells, either in viable or fixed form, can be used for standard binding assays. One 
may measure, for example, for the formation of complexes between an AGT polypeptide or 
fragment and the agent being tested, or examine the degree to which the formation of a complex 
between an AGT polypeptide or fragment and a known ligand, e.g. AGT receptor (ATI), is 

20 interfered with by the agent being tested. 

Thus, the present invention provides methods of screening for drugs comprising 
contacting such an agent with an AGT polypeptide or fragment thereof and assaying <i) for the 
presence of a complex between the agent and the AGT polypeptide or fragment, or <ii) for the 
presence of a complex between the AGT polypeptide or fragment and a ligand, by methods well 

25 known in the art. In such competitive binding assays the AGT polypeptide or fragment is 
typically labeled. Free AGT polypeptide or fragment is separated from that present in a 
proteimprotein complex, and the amount of free <i.e., uncomplexed) label is a measure of the 
binding of the agent being tested to AGT or its interference with AGTrligand binding, 
respectively. 

30 Another technique for drug screening provides high throughput screening for compounds 

having suitable binding affinity to the AGT polypeptides and is described in detail in Geysen, 
PCT published application WO 84/03564, published on September 13, 1984. Briefly stated, 
large numbers of different small peptide test compounds are synthesized on a solid substrate, 
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such as plastic pins or some other surface. The peptide test compounds are reacted with AGT 
polypeptides and washed. Bound AGT polypeptides are then detected by methods well known 
in the art. 

Purified AGT can be coated directly onto plates for use in the aforementioned drug 
5 screening techniques. However, non-neutralizing antibodies to the polypeptide can be used to 
capture antibodies to immobilize the AGT polypeptide on the solid phase. 

This invention also contemplates the use of competitive drug screening assays in which 
neutralizing antibodies capable of specifically binding the AGT polypeptide compete with a test 
compound for binding to the AGT polypeptide or fragments thereof. In this manner, the 

10 antibodies can be used to detect the presence of any peptide which shares one or more antigenic 
determinants of the AGT polypeptide. 

A further technique for drug screening involves the use of host eukaryotic cell lines or 
cells (such as described above) which express a wild-type or mutant AGT gene and as a 
consequence of expression of wild type or mutant AGT demonstrate a specific phenotype.. The 

15 phenotype of the cells is examined to determine if the compound is capable of modulating the 
phenotype and thereby AGT function. 

Briefly, a method of screening for a substance which modulates activity of a polypeptide 
may include contacting one or more test substances with the polypeptide in a suitable reaction 
medium, testing the activity of the treated polypeptide and comparing that activity with the 

20 activity of the polypeptide in comparable reaction medium untreated with the test substance or 
substances. A difference in activity between the treated and untreated polypeptides is indicative 
of a modulating effect of the relevant test substance or substances. 

Prior to or as well as being screened for modulation of activity, test substances may be 
screened for ability to interact with the polypeptide, e.g., in a yeast two-hybrid system (e.g., 

25 Bartel et al., 1993; Fields and Song, 1989; Chevray and Nathans, 1992; Lee et al., 1995). This 
system may be used as a coarse screen prior to testing a substance for actual ability to modulate 
activity of the polypeptide. Alternatively, the screen could be used to screen test substances for 
binding to an AGT specific binding partner, or to find mimetics of an AGT polypeptide. 
Methods of Use: Rational Drug Design 

30 The goal of rational drug design is to produce structural analogs of biologically active 

polypeptides of interest or of small molecules with which they interact (e.g., agonists, 
antagonists, inhibitors) in order to fashion drugs which are, for example, more active or stable 
forms of the polypeptide, or which, e.g., enhance or interfere with the function of a polypeptide 
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in vivo. See, e.g., Hodgson, 1991. Rational drug design can be performed as described herein or 
using well known techniques, such as described in U.S. Patent Nos. 5,800,998 and 5,891,628, 
each incorporated herein by reference. 

In one approach, one first determines the three-dimensional structure of a protein of 

5 interest (e.g., AGT polypeptide or fragments of the AGT polypeptide) or, for example, of the 
AGT-receptor or ligand complex, by x-ray crystallography, by computer modeling or most 
typically, by a combination of approaches. Less often, useful information regarding the structure 
of a polypeptide may be gained by modeling based on the structure of homologous proteins. An 
example of rational drug design is the development of HIV protease inhibitors (Erickson et al, 

10 1990). In addition, peptides (e.g., AGT polypeptide or fragments thereof) are analyzed by an 
alanine scan (Wells, 1991). In this technique, an amino acid residue is replaced by Ala, and its 
effect on the peptide's activity is determined. Each of the amino acid residues of the peptide is 
analyzed in this manner to determine the important regions of the peptide. 

It is also possible to isolate a target-specific antibody, selected by a functional assay, and 

15 then to solve its crystal structure. In principle, this approach yields a pharmacore upon which 
subsequent drug design can be based. It is possible to bypass protein crystallography altogether 
by generating anti-idiotypic antibodies (anti-ids) to a functional, pharmacologically active 
antibody. As a mirror image of a mirror image, the binding site of the anti-ids would be 
expected to be an analog of the original receptor. The anti-id could then be used to identify and 

20 isolate peptides from banks of chemically or biologically produced banks of peptides. Selected 
peptides would then act as the pharmacore. Thus, one may design drugs which have, e.g., 
improved AGT polypeptide activity or stability or which act as inhibitors, agonists, antagonists, 
etc. of AGT polypeptide activity. 

Following identification of a substance which modulates or affects polypeptide activity, 

25 the substance may be investigated further. Furthermore, it may be manufactured and/or used in 
preparation, i.e., manufacture or formulation, or a composition such as a medicament, 
pharmaceutical composition or drug. These may be administered to individuals. 

Thus, the present invention extends in various aspects not only to a substance identified 
using a nucleic acid molecule as a modulator of polypeptide activity, in accordance with what is 

30 disclosed herein, but also a pharmaceutical composition, medicament, drug or other composition 
comprising such a substance, a method comprising administration of such a composition 
comprising such a substance, a method comprising administration of such a composition to a 
patient, e.g., for treatment or prophylaxis of diabetes, use of such a substance in the manufacture 
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of a composition for administration, e.g., for treatment or prophylaxis of diabetes, and a method 
of making a pharmaceutical composition comprising admixing such a substance with a 
pharmaceutical^ acceptable excipient, vehicle or carrier, and optionally other ingredients. 

A substance identified as a modulator of polypeptide function may be peptide or non- 
5 peptide in nature. Non-peptide "small molecules" are often preferred for many in vivo pharma- 
ceutical uses. Accordingly, a mimetic or mimic of the substance (particularly if a peptide) may 
be designed for pharmaceutical use. 

The designing of mimetics to a known pharmaceutical^ active compound is a known 
approach to the development of pharmaceuticals based on a "lead" compound. This might be 

10 desirable where the active compound is difficult or expensive to synthesize or where it is 
unsuitable for a particular method of administration, e.g., pure peptides are unsuitable active 
agents for oral compositions as they tend to be quickly degraded by proteases in the alimentary 
canal. Mimetic design, synthesis and testing is generally used to avoid randomly screening large 
numbers of molecules for a target property. 

15 There are several steps commonly taken in the design of a mimetic from a compound 

having a given target property. First, the particular parts of the compound that are critical and/or 
important in determining the target property are determined. In the case of a peptide, this can be 
done by systematically varying the amino acid residues in the peptide, e.g., by substituting each 
residue in turn. Alanine scans of peptide are commonly used to refine such peptide motifs. 

20 These parts or residues constituting the active region of the compound are known as its 
"pharmacophore". 

Once the pharmacophore has been found, its structure is modeled according to its 
physical properties, e.g., stereochemistry, bonding, size and/or charge, using data from a range 
of sources, e.g., spectroscopic techniques, x-ray diffraction data and NMR. Computational 
25 analysis, similarity mapping (which models the charge and/or volume of a pharmacophore, 
rather than the bonding between atoms) and other techniques can be used in this modeling 
process. 

In a variant of this approach, the three-dimensional structure of the ligand and its binding 
partner are modeled. This can be especially useful where the ligand and/or binding partner 
30 change conformation on binding, allowing the model to take account of this in the design of the 
mimetic. 

A template molecule is then selected onto which chemical groups which mimic the 
. pharmacophore can be grafted. The template molecule and the chemical groups grafted onto it 
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can conveniently be selected so that the mimetic is easy to synthesize, is likely to be 
pharmacologically acceptable, and does not degrade in vivo, while retaining the biological 
activity of the lead compound. Alternatively, where the mimetic is peptide-based, further 
stability can be achieved by cyclizing the peptide, increasing its rigidity. The mimetic or 

5 mimetics found by this approach can then be screened to see whether they have the target 
property, or to what extent they exhibit it. Further optimization or modification can then be 
carried out to arrive at one or more final mimetics for in vivo or clinical testing. 
Methods of Use: Nucleic Acid Based Therapies 

According to the present invention, a method is also provided of supplying wild-type 

10 AGT function to a cell which carries mutant AGT alleles. The wild-type AGT gene or a part of 
the gene may be introduced into the cell in a vector such that the gene remains 
extrachromosomal. In such a situation, the gene will be expressed by the cell from the 
extrachromosomal location. If a gene fragment is introduced and expressed in a cell carrying a 
mutant AGT allele, the gene fragment should encode a part of the AGT protein which is 

15 required for normal physiological processes of the cell. More preferred is the situation where 
the wild-type AGT gene or a part thereof is introduced into the mutant cell in such a way that it 
recombines with the endogenous mutant AGT gene present in the cell. Such recombination 
requires a double recombination event which results in the correction of the AGT gene mutation. 
Vectors for introduction of genes both for recombination and for extrachromosomal 

20 maintenance are known in the art, and any suitable vector may be used. Methods for 
introducing DNA into cells such as electroporation, calcium phosphate coprecipitation and viral 
transduction are known in the art, and the choice of method is within the competence of the 
routineer. See also U.S. Patent Nos. 5,800,998 and 5,891,628, each incorporated by reference 
herein. 

25 Among the compounds which may exhibit anti-diabetes activity are antisense, ribozyme, 

and triple helix molecules. Such molecules may be designed to reduce or inhibit mutant AGT 
activity. Techniques for the production and use of such molecules are well known to those of 
skill in the art, such as described herein or in U.S. Patent No. 5,800,998, incorporated herein by 
reference. 

30 Antisense RNA and DNA molecules act to directly block the translation of mRNA by 

binding to targeted mRNA and preventing protein translation. With respect to antisense DNA, 
oligodeoxyribonucleotides derived from the translation initiation site, e.g., between the -10 and 
+10 regions of the AGT nucleotide sequence of interest, are preferred. 
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Ribozymes are enzymatic RNA molecules capable of catalyzing the specific cleavage of 
RNA. The mechanism of ribozyme action involves sequence specific hybridization of the 
ribozyme molecule to complementary target RNA, followed by a endonucleolytic cleavage. The 
composition of ribozyme molecules must include one or more sequences complementary to the 
5 target AGT mRNA, preferably the mutant AGT mRNA, and must include the well known 
catalytic sequence responsible for mRNA cleavage. For this sequence, see U.S. Pat. No. 
5,093,246, which is incorporated by reference herein in its entirety. As such, within the scope of 
the invention are engineered hammerhead motif ribozyme molecules that specifically and 
efficiently catalyze endonucleolytic cleavage of RNA sequences encoding AGT, preferably 

1 0 mutant AGT proteins. 

Specific ribozyme cleavage sites within any potential RNA target are initially identified 
by scanning the target molecule for ribozyme cleavage sites which include the following 
sequence: GUA, GUU and GUC. Once identified, short RNA sequences of between 15 and 
ribonucleotides corresponding to the region of the target gene containing the cleavage site may 

15 be evaluated for predicted structural features, such as secondary structure, that may render the 
oligonucleotide sequence unsuitable. The suitability of candidate targets may also be evaluated 
by testing their accessibility to hybridization with complementary oligonucleotides, using 
ribonuclease protection assays. 

Nucleic acid molecules to be used in triplex helix formation should be single stranded 

20 and composed of deoxynucleotides. The base composition of these oligonucleotides must be 
designed to promote triple helix formation via Hoogsteen base pairing rules, which generally 
require sizeable stretches of either purines or pyrimidines to be present on one strand of a 
duplex. Nucleotide sequences may be pyrimidine-based, which will result in TAT and 
CGC.sup.+ triplets across the three associated strands of the resulting triple helix. The 

25 pyrimidine-rich molecules provide base complementarity to a purine-rich region of a single 
strand of the duplex in a parallel orientation to that strand. In addition, nucleic acid molrcules 
may be chosen that are purine-rich, for example, contain a stretch of guanidine residues. These 
molecules will form a triple helix with a DNA duplex that is rich in GC pairs, in which the 
majority of the purine residues are located on a single strand of the targeted duplex, resulting in 

30 GGC triplets across the three strands in the triplex. 

Alternatively, the potential sequences that can be targeted for triple helix formation may 
be increased by creating a so called "switchback" nucleic acid molecule. Switchback molecules 
. are synthesized in an alternating 5'-y, 3 f -5 r manner, such that they base pair with one strand of a 
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duplex first and then the other, eliminating the necessity for a sizeable stretch of either purines 
or pyrimidines to be present on one strand of a duplex. 

It is possible that the antisense, ribozyme, and/or triple helix molecules described herein 
may reduce or inhibit the translation of mRNA produced by both normal and mutant A<3T 
5 alleles. In order to ensure that substantial normal levels of AGT activity are maintained in the 
cell, nucleic acid molecules that encode and express AGT polypeptides exhibiting normal AGT 
activity may be introduced into cells which do not contain sequences susceptible to whatever 
antisense, ribozyme, or triple helix treatments. Such sequences may be introduced via gene 
therapy methods. Alternatively, it may be preferable to coadminister normal AGT protein into 
1 0 the cell or tissue in order to maintain the requisite level of cellular or tissue AGT activity. 

Antisense RNA and DNA molecules, ribozyme molecules and triple helix molecules of 
the invention may be prepared by any method known in the art for the synthesis of DNA and 
RNA molecules. These include techniques for chemically synthesizing 
oligodeoxyribonucleotides and oligoribonucleotides well known in the art such as for example 
15 solid phase phosphoramidite chemical synthesis. Alternatively, RNA molecules may be 
generated by in vitro and in vivo transcription of DNA sequences encoding the antisense RNA 
molecule. Such DNA sequences may be incorporated into a wide variety of vectors which 
incorporate suitable RNA polymerase promoters such as the T7 or SP6 polymerase promoters. 
Alternatively, antisense cDNA constructs that synthesize antisense RNA constitutively or 
20 inducibly, depending on the promoter used, can be introduced stably into cell lines. 

Various well-known modifications to the DNA molecules may be introduced as a means 
of increasing intracellular stability and half-life. Possible modifications include, but are not 
limited to, the addition of flanking sequences of ribo- or deoxy- nucleotides to the 5' and/or 3' 
ends of the molecule or the use of phosphorothioate or T O-methyl rather than 
25 phosphodiesterase linkages within the oligodeoxyribonucleotide backbone. 

Gene therapy would be carried out according to generally accepted methods, for 
example, as described , in further detail in U.S. Patent Nos. 5,837,492 and 5,800,998 and 
references cited therein, all incorporated by reference herein. Expression vectors in the context 
of gene therapy are meant to include those constructs containing sequences sufficient to express 
30 a polynucleotide that has been cloned therein. In viral expression vectors, the construct contains 
viral sequences sufficient to support packaging of the construct. If the polynucleotide encodes 
an antisense polynucleotide or a ribozyme, expression will produce the antisense polynucleotide 
or ribozyme. Thus in this context, expression does not require that a protein product be 
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synthesized. In addition to the polynucleotide cloned into the expression vector, the vector also 
contains a promoter functional in eukaryotic cells. The cloned polynucleotide sequence is under 
control of this promoter. Suitable eukaryotic promoters include those described above. The 
expression vector may also include sequences, such as selectable markers and other sequences 
5 conventionally used. 

Methods of Use: Peptide Therapy 

Peptides which have AGT activity can be supplied to cells which cany mutant or 
missing AGT alleles. Peptide therapy is performed as described herein or using well known 
techniques, such as described in U.S. Patent Nos. 5,800,998 and 5,891,628, each incorporated 
10 herein by reference. 

Protein can be produced by expression of the cDNA sequence in bacteria, for example, 
using known expression vectors. Alternatively, AGT polypeptide can be extracted from AGT- 
producing mammalian cells. In addition, the techniques of synthetic chemistry can be employed 
to synthesize AGT protein. Any of such techniques can provide the preparation of the present 
15 invention which comprises the AGT protein. Preparation is substantially free of other human 
proteins. This is most readily accomplished by synthesis in a microorganism or in vitro. 

Active AGT molecules can be introduced into cells by microinjection or by use of 
liposomes, for example. Alternatively, some active molecules may be taken up by cells, actively 
or by diffusion. Extracellular application of the AGT gene product may be sufficient to affect the 
20 development and or progression of diabetes.. Supply of molecules with AGT activity should 
lead to partial reversal of thediabetic phenotype. Other molecules with AGT activity (for 
example, peptides, drugs or organic compounds) may also be used to effect such a reversal. 
Modified polypeptides having substantially similar function are also used for peptide therapy. 

Alternatively, antibodies that are both specific for mutant AGT gene product and 
25 interfere with its activity may be used. Such antibodies may be generated using standard 
techniques described herein or using conventional techniques, such as described in U.S. Patent 
.Nos. 5,837,492; 5,800,998 and 5,891,628, against the proteins themselves or against peptides 
corresponding to the binding domains of the proteins. Such antibodies include but are not 
limited to polyclonal, monoclonal, Fab fragments, F(ab').sub.2 fragments, single chain 
30 antibodies, chimeric antibodies, humanized antibodies etc. 

Methods of Use: Agonists or Antagonists for Diabetes Prevention or Treatment 

Mutations in angiotensinogen predisposing to diabetes suggest that the protein or peptide 
fragments of the protein have roles in the development or continued functioning of the beta islet 
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cells of the pancreas. It is known from the literature that mutation of Cysl8 leads to a reduction 
of the cleavage of angiotensinogen by renin (Rosaet al., 1998). However, there are a series of 
experiments which would be necessary in order to address angiotensinogen' s role in the disease 
and to identify points for pharmacological intervention. 

5 The only known peptide derived by cleavage of angiotensinogen with renin is 

angiotensin 1. Angiotensin 1 (AI) is then subsequently cleaved by angiotensin converting 
enzyme (ACE) to angiotensin 2 (All) which is, itself, metabolized to angiotensin 3 (AIII) and 
smaller peptides. Two human receptors for angiotensin 2, ATI and AT2, are known in the 
literature, but it has been reported that other receptors for these active metabolites exist (for 

10 example, Kozlowski et al., 1993), and can be molecularly cloned. These receptors may have a 
biological distribution that includes the pancreas and/or be selective for one or more of the 
angiotensin 1 metabolites. Such receptors represent good targets for pharmacological 
intervention in diabetes. Equally, agonists, for example AI, All, AIII or synthetic derivatives 
such as those described by Samanen et al. (1991), or antagonists of the known angiotensin 

15 receptors (e.g., losartan) represent drugs for use in the treatment or prevention of diabetes. After 
the initial cleavage by renin a large polypetide remains. The breakdown of this polypeptide into 
other active fragments can be envisioned. Agonist and antagonists of the receptors for these 
fragments as well as the proteases responsible for there formation also represent drugs for the 
treatment or prevention of diabetes. 

20 Methods of Use: Identification of Functional Peptides Derived From AGT 

Nothing, in current literature, is known of the metabolic fate of the remainder of 
angiotensinogen once angiotensin 1 is cleaved off by renin. It is possible that this protein has a 
direct biological function or be cleaved to bio-active peptides. The existence of such peptides 
can be confirmed by using a panel of polyclonal antibodies which bind to angiotensingen and 

25 use these antibodies to look for immuno-reactive peptide fragments in human or other animal 
sera. Once such peptides are identified, their cognate receptors can be cloned using standard 
molecular biology approaches. Another approach would be to screen for receptor partners for 
random fragments of angiotensinogen using the yeast two hybrid system or other techniques to 
discover protein/protein interactions. The peptides, identified by any approach, may, 

30 themselves, have utility as drugs for the treatment of diabetes or their receptors may represent 
points for pharmacological intervention in diabetes. In addition the inhibitors or activators of 
the protease (s) responsible for the cleavage of the full length AGT protein represnet another 
opportunity to develop drugs to treat or prevent diabetes. 
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Methods of Use: Transformed Hosts; Transeenic/Knockout Animals and Models 

Similarly, cells and animals which carry a mutant AGT allele can be used as model 
systems to study and test for substances which have potential as therapeutic agents. These may 
be isolated from individuals with AGT mutations, either somatic or germline. Alternatively, the 
5 cell line can be engineered to carry the mutation in the AGT allele, as described above. After a 
test substance is applied to the cells, the phenotype of the cell is determined. Any trait of the 
transformed cells can be assessed using techniques well known in the art. Transformed hosts, 
transgenic/knockout animals and models are prepared and used as described herein or using well 
known techniques, such as described in U.S. Patent Nos. 5,800,998 and 5,891,628, each 

10 incorporated herein by reference. 

Animals for testing therapeutic agents can be selected after mutagenesis of whole 
animals or after treatment of germline cells or zygotes. Such treatments include insertion of 
mutant AGT alleles, usually from a second animal species, as well as insertion of disrupted 
homologous genes. Alternatively, the endogenous AGT gene(s) of the animals may be disrupted 

15 by insertion or deletion mutation or other genetic alterations using conventional techniques 
(Capecchi, 1989; Valancius and Smithies, 1991; Hasty et al $ 1991; Shinkai et al y 1992; 
Mombaerts et al 9 1992; Philpott et a/., 1992; Snouwaert et a/., 1992; Donehower et aL, 1992) to 
produce knockout or transplacement animals. A transplacement is similar to a knockout because 
the endogenous gene is replaced, but in the case of a transplacement the replacement is by 

20 another version of the same gene. After test substances have been administered to the animals, 
the the diabetic phenotypemust be assessed. If the test substance prevents or suppresses thethe 
diabetic phenotype, then the test substance is a candidate therapeutic agent for the treatment of 
diabetes.. These animal models provide an extremely important testing vehicle for potential 
therapeutic products. 

25 In one embodiment of the invention, transgenic animals are produced which contain a 

functional transgene encoding a functional AGT polypeptide or variants thereof. Transgenic 
animals expressing AGT transgenes, recombinant cell lines derived from such animals and 
transgenic embryos may be useful in methods for screening for and identifying agents that 
induce or repress function of AGT. Transgenic animals of the present invention also can be used 
30 as models for studying indications such as IDDM. 

In one embodiment of the invention, a AGT transgene is introduced into a non-human 
host to produce a transgenic animal expressing a human, murine or other species AGT gene. 
. The transgenic animal is produced by the integration of the transgene into the genome in a 
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manner that permits the expression of the transgene. Methods for producing transgenic animals 

are generally described by Wagner and Hoppe (U.S. Patent No. 4,873,191; which is 

incorporated herein by reference), Brinster et al 1985; which is incorporated herein by reference 

in its entirety) and in "Manipulating the Mouse Embryo; A Laboratory Manual" 2nd edition 
5 (eds., Hogan, Beddington, Costantimi and Long, Cold Spring Harbor Laboratory Press, 1994; 

which is incorporated herein by reference in its entirety). 

It may be desirable to replace the endogenous AGT by homologous recombination 

between the transgene or a mutant gene and the endogenous gene; or the endogenous gene may 

be eliminated by deletion as in the preparation of "knock-out" animals. Typically, a AGT gene 
10 flanked by genomic sequences is transferred by microinjection into a fertilized egg. The 

microinjected eggs are implanted into a host female, and the progeny are screened for the 

expression of the transgene. Transgenic animals may be produced from the fertilized eggs from 

a number of animals including, but not limited to reptiles, amphibians, birds, mammals, and fish. 

Within a particularly preferred embodiment, transgenic mice are generated which overexpress 
15 AGT or express a mutant form of the polypeptide. Alternatively, the absence of a AGT in 

"knock-out" mice permits the study of the effects that loss of AGT protein has on a cell in vivo. 

Knock-out mice also provide a model for the development of AGT-related diabetes. 

Methods for producing knockout animals are generally described by Shastry (1995, 

1998) and Osterrieder and Wolf (1998). The production of conditional knockout animals, in 
20 which the gene is active until knocked out at the desired time is generally described by Feil et al. 

(1996), Gagneten et al. (1997) and Lobe and Nagy (1998). Each of these references is 

incorporated herein by reference. 

As noted above, transgenic animals and cell lines derived from such animals may find 

use in certain testing experiments. In this regard, transgenic animals and cell lines capable of 
25 expressing wild-type or mutant AGT may be exposed to test substances. These test substances 

can be screened for the ability to alter expression of wild-type AGT or alter the expression or 

function of mutant AGT. 

Pharmaceutical Compositions and Routes of A dministration 

The AGT polypeptides, antibodies, peptides and nucleic acids of the present invention 
30 can be formulated in pharmaceutical compositions, which are prepared according to conventional 
pharmaceutical compounding techniques. See, for example, Remington's Pharmaceutical 
Sciences, 18th Ed. (1990, Mack Publishing Co., Easton, PA). The composition may contain the 
active agent or pharmaceutical^ acceptable salts of the active agent. These compositions may 
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comprise, in addition to one of the active substances, a pharmaceutically acceptable excipient, 
carrier, buffer, stabilizer or other materials well known in the art. Such materials should be non- 
toxic and should not interfere with the efficacy of the active ingredient. The carrier may take a 
wide variety of forms depending on the form of preparation desired for administration, e.g., 

5 intravenous, oral, intrathecal, epineural or parenteral. 

For oral administration, the compounds can be formulated into solid or liquid 
preparations such as capsules, pills, tablets, lozenges, melts, powders, suspensions or emulsions. 
In preparing the compositions in oral dosage form, any of the usual pharmaceutical media may 
be employed, such as, for example, water, glycols, oils, alcohols, flavoring agents, preservatives, 

10 coloring agents, suspending agents, and the like in the case of oral liquid preparations (such as, 
for example, suspensions, elixirs and solutions); or carriers such as starches, sugars, diluents, 
granulating agents, lubricants, binders, disintegrating agents and the like in the case of oral solid 
preparations (such as, for example, powders, capsules and tablets). Because of their ease in 
administration, tablets and capsules represent the most advantageous oral dosage unit form, in 

15 which case solid pharmaceutical carriers are obviously employed. If desired, tablets may be 
sugar-coated or enteric-coated by standard techniques. The active agent can be encapsulated to 
make it stable to passage through the gastrointestinal tract while at the same time allowing for 
passage across the blood brain barrier. See for example, WO 96/1 1698. 

For parenteral administration, the compound may be dissolved in a pharmaceutical 

20 carrier and administered as either a solution or a suspension. Illustrative of suitable carriers are 
water, saline, dextrose solutions, fructose solutions, ethanol, or oils of animal, vegetative or 
synthetic origin. The carrier may also contain other ingredients, for example, preservatives, 
suspending agents, solubilizing agents, buffers and the like. When the compounds are being 
administered intrathecally, they may also be dissolved in cerebrospinal fluid. 

25 The active agent is preferably administered in a therapeutically effective amount. The 

actual amount administered, and the rate and time-course of administration, will depend on the 
nature and severity of the condition being treated. Prescription of treatment, e.g. decisions on 
dosage, timing, etc., is within the responsibility of general practitioners or specialists, and 
typically takes account of the disorder to be treated, the condition of the individual patient, the 

30 site of delivery, the method of administration and other factors known to practitioners. 
Examples of techniques and protocols can be found in Remington *s Pharmaceutical Sciences. 

Alternatively, targeting therapies may be used to deliver the active agent more 
specifically to certain types of cell, by the use of targeting systems such as antibodies or cell 
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specific ligands. Targeting may be desirable for a variety of reasons, e.g. if the agent is 
unacceptably toxic, or if it would otherwise require too high a dosage, or if it would not 
otherwise be able to enter the target cells. 

Instead of administering these agents directly, they could be produced in the target cell, 

5 e.g. in a viral vector such as described above or in a cell based delivery system such as described 
in U.S. Patent No. 5,550,050 and published PCT application Nos. WO 92/19195, WO 94/25503, 
WO 95/01203, WO 95/05452, WO 96/02286, WO 96/02646, WO 96/40871, WO 96/40959 and 
WO 97/12635, designed for implantation in a patient. The vector could be targeted to the 
specific cells to be treated, or it could contain regulatory elements which are more tissue specific 

10 to the target cells. The cell based delivery system is designed to be implanted in a patient's body 
at the desired target site and contains a coding sequence for the active agent Alternatively, the 
agent could be administered in a precursor form for conversion to the active form by an 
activating agent produced in, or targeted to, the cells to be treated. See for example, EP 
425,731 A and WO 90/07936. 

15 The practice of the present invention employs, unless otherwise indicated, conventional 

techniques of chemistry, molecular biology, microbiology, recombinant DNA, genetics, 
immunology, cell biology, cell culture and transgenic biology, which are within the skill of the 
art. See, e.g., Maniatis et al, 1982; Sambrook et al, 1989; Ausubel et al, 1992; Glover, 1985; 
Anand, 1992; Guthrie and Fink, 1991; Harlow and Lane, 1988; Jakoby and Pastan, 1979; 

20 Nucleic Acid Hybridization (B. D. Hames & S. J. Higgins eds. 1984); Transcription And 
Translation (B. D. Hames & S. J. Higgins eds. 1984); Culture Of Animal Cells <R. I. Freshney, 
Alan R. Liss, Inc., 1987); Immobilized Cells And Enzymes (IRL Press, 1986); B. Perbal, A 
Practical Guide To Molecular Cloning (1984); the treatise, Methods In Enzymology (Academic 
Press, Inc., N.Y.); Gene Transfer Vectors For Mammalian Cells (J. H. Miller and M. P. Calos 

25 eds., 1987, Cold Spring Harbor Laboratory); Methods In Enzymology, Vols. 154 and 155 (Wuet 
al. eds.), Immunochemical Methods In Cell And Molecular Biology (Mayer and Walker, eds., 
Academic Press, London, 1987); Handbook Of Experimental Immunology, Volumes I-IV (D. M. 
Weir and C. C. Blackwell, eds., 1986); Hogan et al., Manipulating the Mouse Embryo, (Cold 
Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y., 1986).. A general discussion of 

30 techniques and materials for human gene mapping, including mapping of human chromosome 1, 
is provided, e.g., in White and Lalouel, 1988. 
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EXAMPLES 

The present invention is described by reference to the following Examples, which are 
offered by way of illustration and are not intended to limit the invention in any manner. 
Standard techniques well known in the art or the techniques specifically described below were 
5 utilized. 

EXAMPLE 1 
Methods 

Subjects, genotvping. and haplotvpe analyses. The families used in this study are part of 

10 a collection of IDDM families at the Human Biological Data Interchange (HBDI), Philadelphia, 
PA. Genomic DNA samples were ordered from the HBDI Repository located at the Coriell 
Institute for Medical Research in Camden, NJ. In the initial part of this study samples were 
ordered for individuals from 38 nuclear families with 3 siblings affected with IDDM. After 
identification of sequence variants in this first set, of families, an additional 308 unrelated 

15 individuals with IDDM were obtained from HBDI. The initial control set of 120 samples were 
the grandparents of the Utah families included in the Centre d'Etude du Polymorphisme Humain 
(CEPH) database. Unrelated spouses from other Utah families were used as additional controls. 
No information on the presence or absence of diabetes in these samples was available. 
Genotypes using fluorescently labelled primers were determined for 1 1 simple tandem repeat 

20 polymorphisms spanning the published IDDM linkage on chromosome lq (Concannon et aL, 
1998). Marker positions and primer sequences were obtained from Genethon (D1S237, D1S469, 
D1S439, D1S479, D1S437, D1S225, D1S251, D1S1568, AFMa072xa5xl), Marshfield 
(D1S320), and CHLC (D1S1621). Haplotype analysis was performed by using the program 
MCLINK (Thomas et al., 1999). Comparisons of observed frequencies of mutation in cases and 

25 controls were made with a chi-squared, log likelihood ratio, test. The null hypothesis is that the 
observations come from Binomial distributions with the same probability parameters. The p- 
value quoted for each comparison is that for rejection of the null hypothesis in favor of the 
alternative hypothesis that the Binomials have unequal probability parameters. 

Genomic sequencing and mutation detection. A human genomic BAG library (Research 

30 Genetics) was screened by PCR for clones containing the marker WI18298. The end sequences 
of BAC 1-30 were used as probes to assemble a BAG contig. 6x genomic sequencing coverage 
of BACs 1-30, 1-104 and 1-112 was obtained by sequencing of plasmid libraries constructed 
with randomly sheared and &m3A-digested BAC DNA. For mutation detection exons were 
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amplified from genomic DNA v/ith primers placed in intronic sequences flanking each exon. 
Nested amplifications were performed with primers containing M13 forward (5'- 
GTTTTCCC AGTC ACG ACG-3 ' ; SEQ ID NO:9) and reverse (5 ' - AGG AAAC AGCTATG AC 
CAT-3'; SEQ ID NO: 10) universal primer sequences at the 5' termini. Nested PCR products 
5 were cycle sequenced directly and run on ABI 377 automated -sequencers (Perkin-Elmer). 
Sequence data were analyzed for polymorphisms using software developed at Myriad Genetics, 
Inc. 

GenBank accession numbers. Human AGT retina EST, AA059356; human AGT 
M24689; rat AGT L00094; sheep AGT D17520; mouse AGT AF045887; gorilla AGT 188488; 
1 0 chimpanzee AGT 1 88487. The sequences of AGT for gorilla and chimpanzee were generated in 
this study. 

EXAMPLE 2 
Association of AGT Gene and IDDM 

1 5 To investigate whether mutations in AGT are responsible for the reported linkage of type 

1 diabetes to the chromosomal region lq42-43, we carried out a three stage analysis. Initially 
we obtained a set of 38 nuclear families, each with three children affected with type 1 diabetes 
from the Human Biological Data Interchange. These were screened to identify potentially 
deleterious mutations. We then screened independent sets of 308 unrelated cases and 1003 

20 controls to determine whether the identified mutations were more common in cases than 
controls. Finally we screened all 308 cases and 631 of the 1003 controls to determine whether 
rare polymorphisms, potential causal mutations, were more common in cases than controls. In 
order to select from the 38 initial families those that could potentially segregate a mutation 
among all affected members, each family was genotyped using 11 microsatellite markers 

25 spanning a 32 cM region surrounding the AGT locus on chromosome 1 (Fig. 1). This revealed 
that in 20 of the 38 families the affected individuals shared a haplotype in part of the 32 cM 
region. 

The entire coding sequence, intron-exon boundaries, and 5' and 3' untranslated regions 
of AGT were screened for sequence variants in the genomic DNA of one affected member from 
30 each of the 20 nuclear families. Genomic DNA was amplified using primers positioned in 
introns flanking each exon of AGT (Table 1). Three families each contained a unique 
polymorphism in AGT, which we later determined segregated into all affected individuals in the 
family. 
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In family 24, a T to C transition was found in exon 2 at nucleotide 190 in the cDNA, 
which resulted in a substitution of arginine for cysteine at position 18 in the angiotensinogen 
mature peptide (Fig. 2a; Fig. 3a; 3b). Cysl8 is one of two conserved cysteine residues in 
mammalian angiotensinogen(s) (Fig. 3) that has been shown in vitro to form a disulfide bond 

5 with Cysl38 (Gimenez-Roqueplo et al., 1998a; Streatfeild-James et al., 1998). Disruption of 
disulfide bond formation might affect an unknown role for A GT in diabetes. Although it seems 
likely that disulfide bond formation contributes to a structurally constrained N-terminus 
(Streatfeild-James et al., 1998), disruption of the disulfide bond by in vitro mutagenesis of 
Cysl8 did not affect the rate of angiotensinogen cleavage by renin (Gimenez-Roqueplo et al., 

10 1998a). However, under physiological conditions, alteration of the protein's structure might 
disrupt association with other binding partners that would then indirectly affect cleavage by 
renin (Inoue et al., 1997). It is also possible that mutation of Cysl8 indirectly affects 
glycosylation of angiotensinogen. The partner with which Cysl8 forms a disulfide bond, 
Cysl38, lies in the middle of the N-glycosylation site Asnl37-Cysl38-Thrl39 (Fig. 3b). Of the 

15 four N-glycosylation sites in angiotensinogen, Asnl37 is the least glycosylated (Gimenez- 
Roqueplo et al., 1998b). In the absence of disulfide bond formation at Cysl38, Asnl37 might 
be glycosylated more frequently. Mutations in AGT that lead to increased glycosylation affect 
the amount of angiotensinogen secreted, and therefore indirectly the amount of Aug I (the 
peptide cleaved from AGT by renin and further converted to Ang II) produced (Gimenez- 

20 Roqueplo et al., 1996). Consequently, Cysl8Arg would cause a decrease in Ang I formation. 

Mutations detected in family 19 and family 30 were both in a 5' untranslated region of 
AGT. An alternative 5' exon of AGT has been reported in the EST database. This alternate 
form was verified by PCR and sequencing in this study. We determined from our BAC contig 
that the alternative exon lies at least lOOkb upstream of the published exon I (Fig. 1, Fig. 3), and 

25 it was only detected by PCR in retina cDNA. The alternative exon that splices into exon II does 
not contain an open reading frame that extends out of the exon. Translation of the alternative 
transcript is predicted to begin at the first methionine in exon II (Fig. 3a) as previously reported 
(Gaillard et al., 1989). Although the alternative transcript produces the same angiotensinogen 
polypeptide as the published AGT cDNA, its expression could be regulated in a tissue-specific 

30 manner. It is not known what effect variants in the 5' untranslated region would have on the 
expression or stability of the transcript. A C to T transition at nucleotide 96 in the exon was 
seen in all affected individuals in family 19 (Fig. 2b; Fig. 3a). In family 30, a T to C transition 
at nucleotide 286 in this exon segregated with the disease (Fig. 2c; Fig. 3a). 
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Family 30 also contained a C to T transition at nucleotide 928 in exon 3, resulting in a 
non-conservative substitution of serine for proline at position 264 (Fig. 3a, 3c). This position is 
a proline in hominoids and a hydrophobic amino acid in other non-primate species (Fig. 3c); 
therefore the substitution of a hydrophilic amino acid (serine) is expected to have profound 

5 structural consequences. The Pro264Ser missense mutation is linked to a silent G to A transition 
at the preceding nucleotide, as well as to a C to T transition at -81 1 in the upstream genomic 
sequence (shown in SEQ ID NO:l). This haplotype of three linked polymorphisms (C-811T, 
G927A and Pro264Ser) did not segregate into all the affected individuals in family 30 (Fig. 2d, 
2e). However, this is not unexpected given that type 1 diabetes is a complex trait, and that 

10 predisposition to the disease probably involves a combination of mutations in multiple genes. If 
carrying a subset of these mutated genes results in the diabetic phenotype, members of the same 
family that do not share at the AGT locus could have an overlapping set of mutations in other 
gene(s). 



15 EXAMPLE 3 

Identification of Additional Polymorphisms 
If the AGT polymorphisms identified in Example B are truly predisposing mutations, 
they should be detected in other type 1 diabetic individuals and should be rare or absent in 
controls. To identify additional mutations, a set of DNA samples from 308 unrelated type 1 

20 diabetic individuals was screened for sequence variants in AGT. These cases were selected 
from 308 additional families in the Human Biological Data Interchange. A set of control 
samples was screened in the same way for statistical comparison (see Example 1 and Table 2). 
The Cysl8Arg mutation was observed three times in the new case set. It segregated with the 
disease in family 164, but not in families 214 and 344 (Table 2). The Cysl8Arg mutation was 

25 observed in only 1 of 1003 control samples, indicating that it is enriched in the disease set (3 of 
308 vs. 1 of 1003,/? =0.03). 
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In the new case set the Pro264Ser mutation was seen once, with its accompanying 
polymorphisms, but was not seen in the controls (1/308 vs. 0/1003, p =0.09). The C-811T 
polymorphism was observed once in the controls but in isolation, not as part of the C-811T, 
G927A and Pro264Ser haplotype. The G297A change was not seen in the controls. 

5 A number of novel polymorphisms in AGT were detected in the second set of unrelated 

diabetic individuals. Segregation analyses showed that about 50% of the polymorphisms 
segregated into all affected individuals within a family (Table 2). An individual in family 54 
contained a G to T transversion at nucleotide 26 in alternative exon I. Within the genomic 
sequence upstream of exon I, the sequence variants A(-715)G, T(-363)A, G(-259)A, T(-253)C 

10 and G(-183)A were detected in individuals from families 266, 162, 339, 236 and 150, 
respectively. Three other polymorphisms in the upstream genomic sequence, T(-741)C, AG(- 
603) and C(-281)T, were linked in family 1 14. Although each of these changes lies upstream of 
the characterized proximal promoter in AGT (Brasier et al., 1999), they could potentially alter 
AGT expression by affecting promoter elements not yet identified. Recently, Yanai et al. (1999) 

15 have reported a role for hepatocyte nuclear factor 4 (HNF4) and chicken ovalabumin upstream 
pormoter transcription factor (COUP-TFII) in regulating transcription of the angiotensinogen 
gene. Truncating mutations in the HNF4 gene were found in patients with maturity onset 
diabetes of the youn, and the HNF4 gene is therefor also called MODY1 (Yamagata et al., 1996; 
Lidner et al., 1997). One of the DNA fragments identified as an HNF4 activating element by 

20 deletion analysis is the fragment from -242 to -295. Three of the variants in the sequence 
upstream of exon 1 are within this fragment (T(-253)C in family 236, G(-259)A in family 339 
and C(-281)T in family 1 14). None of these variants was detected in 63 1 controls. 

Three previously unreported missense mutations were detected in the coding sequence of 
AGT. An Alal22Val missense mutation was detected in family 305, an Asp3 19Asn was present 

25 in family 3 18, and an Ala401 Val was found in family 272. Of these, only Alal22Val segregated 
with the disease in the family. Considering the rare missense mutations which were detected in 
AGT 9 five are at positions highly conserved in primates (Cysl8Arg, Alal22Val, Pro264Ser, 
Asp319Asn, and Ala401Val, see Fig. 3b, 3c). In the 3* UTR a G to T transversion 35 bp 
downstream of the translation stop (G1530T) was observed in an individual from family 229. 

30 The discovery of enhancer elements in exon 5 and in the genomic sequence downstream of AGT 
(Nibu et al., 1994a; Nibu et al., 1994b) suggests this change might affect ,4 expression. This 
change was detected once in 636 controls. Finally, two missense mutations that had been 
identified previously in hypertensive individuals were seen once in the set of type 1 diabetics. A 
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Tyr248Cys mutation was detected in an individual from family 348. This change was 
previously reported to affect the rate of angiotensinogen production in hypertensive individuals 
{Gimenez-Roqueplo et al., 1996). An individual in family 119 contained a Leu359Met 
substitution that had been shown previously in a hypertensive patient (Jeunemaitre et al., 1992). 

5 Both changes were also found in our control samples (Table 2). Neither of these missense 
changes segregated with diabetes in these families. 

All of the coding and nod-coding regions of AGT which were screened for 
polymorphisms in type 1 diabetics were also sequenced in 631 control individuals. Several 
polymorphisms were seen in the controls that were not seen in the cases (see Table 3). The 

10 frequency of rare changes, defined as present at less than 5 times in all samples, was compared 
in cases and controls. Fewer control samples earned rare changes when compared to type I 
diabetic cases (20 of 308 vs. 17 of 631, /t=0.007). It is interesting to note that there were no 
polymorphisms detected in the sequence upstream of exon 1 between -64 and -413 in the 631 
controls. There were five polymorphisms detected in this region in the type 1 diabetic cases 

15 including the three changes found in the fragment involved in activation of transcription of AGT 
by HNF4. 
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To conclude, the nature of the polymorphisms found in an initial set of diabetic families, 
their combined frequency in an independent set of cases compared with that in a set of controls 
(4/308 vs. 1/1003, /7=0.0O8), and the abundance of rare mutations in type I diabetic cases all 
indicate that mutations in AGT increases susceptibility to the disease. The identification of 
5 missense mutations confirms the importance of the mature part of the protein in AGT structure 
and function. Novel polymorphisms in the alternative 5' exon could provide clues to tissue- 
specific functions of the renin-angiotensin system. The role of AGT in the microvascular 
complications of diabetes (Krolewski et aL, 1998; Rogus et al., 1998) can be further considered. 
Perhaps the most interesting prospect is that our results in conjunction with the recent 
10 demonstration of angiotensin II receptors on cells of the islets of Langerhans that also stain for 
insulin, implies a direct role for AGT in pancreatic P-cell function (Tahmasebi et al., 1999). 
Understanding the contribution of AGT to type 1 diabetes could lead to new treatments and 
preventative medicines for the disease. 

15 While the invention has been disclosed in this patent application by reference to the 

details of preferred embodiments of the invention, it is to be understood that the disclosure is 
intended in an illustrative rather than in a limiting sense, as it is contemplated that modifications 
will readily occur to those skilled in the art, within the spirit of the invention and the scope of 
the appended claims. 

20 
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WHAT IS CLAIMED IS: 

1 . A nucleic acid selected from the group consisting of: 

(a) a DNA molecule. comprising the nucleotides set forth in SEQ ID NO:l, or 
complement thereof; 

(b) a DNA molecule comprising the nucleotides set forth in SEQ ID NO:2, or 

complement thereof; 

(c) a DNA molecule comprising the nucleotides set forth in SEQ ID NO:3, or 

complement thereof; 

(d) a DNA molecule comprising the nucleotides set forth in SEQ ID NO:4, or 

complement thereof; 

(e) a DNA molecule comprising the nucleotides set forth in SEQ ID NO:5, or 
complement thereof; and 

(f) a DNA molecule comprising the nucleotides set forth in SEQ ID NO:8, or 

complement thereof. 

2. A nucleic acid selected from the group consisting of: 

(a) a DNA molecule comprising the nucleotides set forth in SEQ ID NO:8, 
wherein G at nucleotide position 26 of the exon is substituted with T, or a complement 
thereof; 

(b) a DNA molecule comprising the nucleotides set forth in SEQ ID NO:8, 
wherein C at nucleotide position 96 of the exon is substituted with T, or a complement 
thereof; 

(c) a DNA molecule comprising the nucleotides set forth in SEQ ID NO:8, 
wherein T at nucleotide position 286 of the exon is substituted with C, or a complement 
thereof; 

(d) a DNA molecule comprising the nucleotides set forth in SEQ ID NO:l, 
wherein C at nucleotide position 2201 is substituted with T, or a complement thereof 

(e) a DNA molecule comprising the nucleotides set forth in SEQ ID NO:l, 
wherein T at nucleotide position 2271 is substituted with C, or a complement thereof 

(f) a DNA molecule comprising the nucleotides set forth in SEQ ID NO:l, 
wherein G at nucleotide position 603 is deleted, or a complement thereof 

(g) a DNA molecule comprising the nucleotides set forth in SEQ ID NO:l, 
wherein C at nucleotide position 273 1 is substituted with T, or a complement thereof 
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(h) a DNA molecule comprising the nucleotides set forth in SEQ ID NO:l, 
wherein A at nucleotide position 2297 is substituted with G, or a complement thereof 

(i) a DNA molecule comprising the nucleotides set forth in SEQ ID NO:l, 
wherein T at nucleotide position 2649 is substituted with A, or a complement thereof 

5 (j) a DNA molecule comprising the nucleotides set forth in SEQ ID NO:l, 

wherein G at nucleotide position 2753 is substituted with A, or a complement thereof 

(k) a DNA molecule comprising the nucleotides set forth in SEQ ID NO:l, 
wherein T at nucleotide position 2759 is substituted with C, or a complement thereof 

(1) a DNA molecule comprising the nucleotides set forth in SEQ ID NO:l, 
10 wherein G at nucleotide position 2829 is substituted with A, or a complement thereof 

(m) a DNA molecule comprising the nucleotides set forth in SEQ ID NO:7, 
wherein T at nucleotide position 190 is substituted with C, or a complement thereof; 

(n) a DNA molecule comprising the nucleotides set forth in SEQ ID NO:7, 
wherein C at nucleotide position 503 is substituted with T, or a complement thereof; 
15 ( 0 ) a DNA molecule comprising the nucleotides set forth in SEQ ID NO:7, 

wherein G at nucleotide position 927 is substituted with A, or a complement thereof; 

(p) a DNA molecule comprising the nucleotides set forth in SEQ ID NO:7, 
wherein C at nucleotide position 928 is substituted with T, or a complement thereof; 

(q) a DNA molecule comprising the nucleotides set forth in SEQ ID NO:7, 
20 wherein G at nucleotide position 1 093 is substituted with A, or a complement thereof; 

(r) a DNA molecule comprising the nucleotides set forth in SEQ ID NO:7, 
wherein C at nucleotide position 1340 is substituted with T, or a complement thereof; 
and 

(s) a DNA molecule comprising the nucleotides set forth in SEQ ID NO:7, 
25 wherein G at nucleotide position 1 529 is substituted with T, or a complement thereof; 

3. A polypeptide selected from the group consisting of: 

(a) a protein molecule comprising the amino acids set forth in SEQ ID NO:7, 
wherein Cys at position 5 1 is substituted with Arg; 
30 (b) a protein molecule comprising the amino acids set forth in SEQ ID NO:7, 

wherein Ala at position 1 55 is substituted with Val; 

(c) a protein molecule comprising the amino acids set forth in SEQ ID NO:7, 
wherein Pro at position 297 is substituted with Ser; 
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(d) a protein molecule comprising the amino acids set forth in SEQ ID NO:7, 
wherein Asp at position 352 is substituted with Asn; and 

(e) a protein molecule comprising the amino acids set forth in SEQ ID NO:7, 
wherein Ala at position 434 is substituted with Val. 



4. A method for detecting an alteration in AGT wherein said alteration is associated with 
IDDM in a human, wherein said method comprises analyzing an AGT gene or an AGT 
gene expression product from a tissue of said human. 

5. A method as claimed in claim 4 wherein the sequence of the AGT gene in said sample is 
compared with the sequence of one or more wild-type ^G7gene sequences. 

6. A method for determining whether a human subject has or is at risk for developing 
diabetes mellitus comprising the step of: 

a) obtaining a sample from a subject, said sample comprising nucleic acid 
molecules containing AGT gene; and 

b) detecting the presence or absence of a genetic polymorphism in the gene of 
said subject, wherein the presence of said genetic polymorphism identifies a subject that 
has or is at risk for developing diabetes. 

7. A method for determining whether a human subject has or is at risk for developing 
diabetes mellitus comprising the step of: 

a) obtaining a sample from a subject, said sample comprising nucleic acid 
molecules containing AGT gene; and 

b) detecting the presence or absence of a genetic polymorphism in the gene of 
said subject, wherein the presence of said genetic polymorphism identifies a subject that 
has or is at risk for developing diabetes, 

wherein said polymorphism is one of those specified in claim 2. 

8. A method for preventing or treating diabetes in a subject which comprises administering 
to the subject a therapeutically effective amount of a compound that agonizes or 
antagonizes wild-type AGT, or agonizes or antagonizes mutant AGT, or agonizes or 
antagonizes an AGT receptor. 
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The method of claim 8, wherein said compound is selected from the group consisting of: 

(a) a drug; 

(b) an antisense molecule; 

(c) a ribozyme; 

(d) a triplex molecule; 

(e) wild-type AGT nucleic acid; 

(f) wild-type AGT protein; 

(g) a protein which binds wild-type or mutant AGT protein: 

(h) a peptidomimetic; 

(i) a non-peptide, non-nucleic acid small molecule; and . 
(j) an antibody. 

A method for preventing or treating diabetes in a subject which comprises administering 
to the subject a therapeutically effective amount of a compound that agonizes or 
antagonizes wild-type AGT fragments), or agonizes orantagonizes mutant AGT, or 
agonizes or antagonizes a receptor for a fragment or fragments of AGT. 

The method of claim 10, wherein said compound is selected from the group consisting 
of: 

(a) a drug; 

(b) an antisense molecule; 

(c) a ribozyme; 

(d) a triplex molecule; 

(e) wild-type AGT nucleic acid; 

(f) wild-type AGT protein; 

(g) a protein which binds wild-type or mutant AGT protein: 

(h) a peptidomimetic; 

(i) a non-peptide, non-nucleic acid small molecule; and . 
(j) an antibody. 

A method for preventing or treating diabetes in a subject which comprises administering 
to the subject a therapeutically effective amount of a compound that agonizes or 
antagonizes a protease that cleaves AGT or any of its fragments. 
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13. The method of claim 12, wherein said compound is selected from the group consisting 
of: 

(a) a drug; 

(b) an antisense molecule; 

(c) a ribozyme; 

(d) a triplex molecule; 

(e) wild-type A GT nucleic acid; 

(f) wild-type AGT protein; 

(g) a protein which binds wild-type or mutant AGT protein: 

(h) a peptidomimetic; 

(i) a non-peptide, non-nucleic acid small molecule; and . 
(j) an antibody. 

14. A non-human animal which carries an altered AGT allele in its genome, wherein said 
allele comprises one of the polymorphisms set forth in claim 2. 

15. A cell line isolated from the non-human animal of claim 14. 

16. A method of screening for drug candidates useful in treating diabetes resulting from a 
mutation in AGT, wherein said method involves mixing a mutant AGT in both the 
presence of a drug and the absence of said drug and measuring the level of the biological 
activity of the mutant AGT, wherein if the level of the biological activity is less in the 
presence of said drug than in the absence of said drug then said drug is a drug candidate 
for treating diabetes. 

17. A method of screening for drug candidates useful in treating diabetes resulting from a 
mutation in AGT, wherein said method comprises treating an animal which is 
heterozygous or homozygous for AGT containing said mutation with a drag wherein if 
said animal does not develop diabetes then said drug is a drug candidate for treating 
diabetes. 
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A method for screening potential diabetes therapeutics which comprises: combining an 
AGT binding partner and a compound suspected of being an diabetes therapeutic and 
measuring the biological activity of the binding partner. 
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• <160> 68 



<170> Patentln Ver. 2. 


0 






<210> 1 

<211> 5308 

<212> DNA 

<213> Homo sapiens 








<220> 

<221> raise feature 
<222> (3012) . . (3047) 
<223> exon 1 of AGT gene 






<400> 1 
gccccagcca 


cttgacccaa 


cctggagctc 


acaaaacccc gtgttgtata aaagtaaggt 


£0 


ttaagggtct 


agggctgtgt 


caaagtggct 


ggggcaaagc tacaaattaa caacatctca 


120 


gcaaagcaat 


tgtttaaagt 


acagctcttt 


ttcaaaatgg agtctcttat gtcttccctt 


180 


tctatataga 


cacagtaaca 


gtctgatctc 


tctttttttt -ccctacatat cccaatagca 


240 


cctgcctgac 


taatacatca 


tgctctgctg 


actccatatg tggctggttt cctggatccg 


300 


attctgactg 


atggtatctg 


ttctcacgct 


gctaataaag acataccaga gactgggtaa 


3S0 


tttataaaga 


aaaagaggtt 


taatggactt 


atgtgttcca cgtggctggg gaggcctcac 


420 


aatcatggtg 


gaaggtgaag 


gaggagcaaa 


agcacatctt acatggtggc tggcaaaaga 


480 


gagaaagcat 


gttcagggga 


actccccttt 


ataaaaccat cagatotcat gagacttatg 


540 


cactatcaca 


agaacagcac 


cagaaagatc 


caccctcatg attcaattac ctcccacfcgg 


600 


gtocctcccg 


tgacacatgg 


gaattatggg 


agctacaatt ggagatttgg gt^gggacac 


660 


agccaaacca 


tatcagatgg 


cttatttggt 


ttctatgtag aacctctgct tttcattcaa 


720 


cagtcttcat 


ttagccacag 


ataagctctg 


tccctaactt cegctgat-gg aatgtacaca 


780 
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taagaaactt 


ccactgatgg 


aatgaacaca 


gaatctgagc 


tgggtcaaat 


gtctgcagtc 


tttacattct 


caataaaatt 


ctctagcaat 


cttacataaa 


agatccccca 


gctgatagtc 


caatcttcfca 


acaagt t t aa 


gcagatattc 






a cert at cat t 




crcrt cetera a a 


ttttattttc 


t- pant t nt- 1" 1" 
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ttcrttcacrtc 


y y L/dCLa l. i— l. 




tttcrttatta 


yuua i»u.y uyy 


a t a t nee cr t o 

aba uyu^y y 


actccttoaa 


ft* of* rprt t* c 


acrtoccctaa 


taccatgt at 


'tLCJCCtLCaCCC 

l«Vjw^> k_ w w 


actocatcac 

u- \» *H W w ui 


ttgtgatcac 


gtcatgccct 


gttataacga 


ctacaaaagc 


aaccctttta 


ctggtcatgt 


gaaacttacc 


cagcacatct 


ttcaatgect 


gcatcctgaa 


gtgctattgt 


tggtgtttaa 


cagtctcccc 


ttcacttget 


tgtgtgtttt 


ccccagtgtc 


ttggagcagc 


tgaaggtcac 


acatcccatg 



2 



gaaggtgcct 


actgggaaga 


aaactggcct 


840 


agtttgaatg 


gctgctcctt 


atgggaataa 


900 


tttctgattg 


attttaatga 


gctttaaagc 


960 


agecttggge 


atggattaag 


ggcttttaac 


1020 


tttattgggt 


ccaatctaac 


caaaattatt 


1080 


attaagagaa 


gtttggcttg 


ettagaggee 


1140 


ttttacacca 


ctttccagca 


tgttacctga 


1200 


gagtgatcat 


tccaagggcc 'cgaggcagga 


1260 


accaggatct 


ttgttttgtt 


ccctgacata 


1320 


aacaaattgt 


gaggctttaa 


agagecgaaa 


1380 


taatcagtgc 


tagatcctta 


cctccccctt 


1440 


accacttgct 


gatcctcaaa 


caactagtta 


1500 


ct act gt tec 


aactaaccca 


ttegcaggea 


1560 


aaaattaaat 


taaaattgtc 


tatttattta 


1620 


cgagtgcagt 


ggtgcaatca 


tagctcactg 


1680 


t tgaattaca 


gatgegagge 


actgtacctg 


1740 


tetaagtege 


l. y uy y y y l-uc 


pa naraaflt 1 a 
^dyauaavj ^y 


1800 


aaataattaa 


aaaaatgtat 


ttcagaattt 


1860 


atgaagattt 


agagtaatgg 


gtaaaaaaga 


1920 


agtgaatgta 


cagcttctgc 


ctcatatcca 


1980 


aatgttccat 


tcctgggtaa 


tttcatgtct 


2040 


cctgcttgtg 


ttgaagcagg 


atcttccttc 


2100 


ttaaggctgg 


acacatcacc 


actcccaacc 


2160 


tggcttctgg 


cgactctcac 


caaggtctct 


2220 


aagtcttacc 


tataggaaaa 


taagaattat 




atttgeaatt 


tgtacagcat 


aaacacagaa 


2340 


ggcattttgt 


ttgtgtcttt 


caatctggct 


2400 


agctacactg 


gaaacttcca 


gaaggcactt 


2460 


tattagaggc 


etttgeacag 


ggtaggctct 


2520 


agegggcage 


agggtcagaa 


gtggcccccg 


2580 
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tgttgcctaa gcaagactct cccctgccct 
ctgtggcctc ttgggggtac atctcccggg 
caggctgtca cacacctagg gagatgctcc 
caaacttcgg taaatgtgta actcgaccct 
tctgcatcga tcactaagac ttcctggaag 
ctgtccttct ggccagcctg tggtctggcc 
cacaggcagc ctgggaacag ctccatcccc 
ccggccaggg gaagaagctg ccgttgttct 
cccctcagct ccttctcggc cttgtctctc 
aaaaggcccg ggaggaggca cggtgatgac 
tcatccggac gcgggctttc ccccactcgg 
ctcctcctca gcaataggac aagatggtca 
gtcctcagga acggagcaga ctggtacagg 
ggtcctcttg ccatcaggag gaatagcagg 
ggccgggctc ggtcagtctc cagctgggat 
agacagactg cctgactgtg tgtcatcagg 
ctcacctata aaatggaact aataacgatg 
tagctgaaga gaactaatat aacatgaaag 
gcagattcca attttattcg agggcagcgt 
gtcagggggc ctggttctga gtcctagttc 
agccactgga cctctctgtt cctctgcaaa 
ccaggatgca gtaggtccca gggactatcc 
cagcgggaac cctgttccgg ggctcacatt 
acagaataag tgcataacta catggtacag 
atgagagagt tgagagtgcc cggtgtggtg 
gggaaacctc agggaaggtg aggactatgg 
gccctgcaaa tggcaggcac cacaggagca 
cccgggagaa gtagggagac atagaagggg 
aggagaaggc ggaggggcac tcagggcgag 
tactcccagg acgcggctgc gtgcagacct 



3 



ctgccctctg 


cacctccggc ctgcatgtcc 


2640 


gctgggtcag 


aaggcctggg tggttggcct 


2700 


cgtttctggg 


aaccttggcc ccgactcctg 


2760 


gcaccggctc 


actctgttca gcagtgaaac 


2820 


aggtcccagc 


gtgagtgtcg cttctggcat 


2880 


aagtgatgta 


accctcctct ccagcctgtg 


2940 


acccctcagc 


tataaatagg gcatcgtgac 


3000 


gggtactaca 


gcagaaggta agccgggggc 


3060 


tcagatgtaa 


ctgagctgtg ggctaggagg 


3120 


tgaaaaacct 


ctcccctctc ataagaocag 


3180 


tgcccacctg 


gggtcttaca ggaggagctg 


3240 


ggtcttcctg 


ttccgctgag aaaagttagg 


3300 


aacagagtca 


tcatggccaa gagtccaccg 


33*0 


gcttgtgcag 


gaattggggc tggagggaag 


3420 


ccccagagtg 


gtcaccctac ccctccctcg 


3480 


ctggtcaccg 


tctccctgaa cctcgatttg 


3540 


cctgggctcc 


ctgtctcagg ggctctggta 


3*00 


tgctttctaa 


gctttgggat aagctaaaag 


3660 


agattggtgc 


ttcagctcgt ggatgacaga 


3720 


tgtctcttcc 


cagctgtgtg acgttgaaca 


3780 


acagcatgaa 


ccaattcatt aactacttct 


3840 


taggaatgtg 


ggctgtatta gtaaacacaa 


3900 


cacatcagag 


caaacaga-ca aagacgctgg 


3960 


agggttataa 


ggagggaaaa ggggagctgg 


4020 


gggaaagctg 


cagggtgaaa tactgcatca 


4080 


tgaggtcaga 


ggggttgata tgagaacagt 


4140 


tgagccgtca 


tcttcacctt tagcattcag 


4200 


caggtgctgg 


ccaagaggca ggggcaggag 


42*0 


ggtgtcaggc 


ccgccacccc agagcaccat 


4320 


ggaaccagcc 


tagggagcag ccgcagatca 


4380 
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caactgagaa caaacgacag tctctgcctc aaaaatggcc catggaattg cgtctctgga 4440 
gacgctgcct gagcaggagc agcacagtga gcgggctgca tcgaccagcg ccatccaaac 4500 
cccgaacagt tggcgcttgt caggcaggac ttcccagcag tcggttccca caggtttccc 4560 
ctgttgacct gatttgatgt gactgtctag attaggtgtg aactggtggc ttaggcttct 4620 
ctgcacagaa aggcctgcaa gcagcagaga gagttttctg ttccattttt ccatgtcatg 4 680 
tggctcttcc tgagaacagc ggatggagtc aaatgcatgg ggagtggggt gagatggtag 4 740 
ctgaggtcag aatttggcat ttgaatgact gaagcagaac aaaacacacc aggtacttca 4800 
gcagctgcac cgtgttgagg gcaggtgctg gttacgggtc tgggtgaggg aagccagctg 4860 
ccaatgtaag aagaatgact gggtatgctt agatgaagca gaaaaatcta ggcatcaagg 4920 
tggccttgag tcagtgatga cacgctacag ctccaaggaa gcctggccta gccctggggg 4980 
gacagaaaag gccaagaagt gacgatattg cagtacaccc ccctccacaa gaaatgagtg 5040 
agatgtggta caaaatgtta gaattgaatg aatcaataga ataaacgttc atcccttcaa 5100 
tcaagaagag tcagatgaaa tgaattagca gggccagccc aagaacctct tctgggggtc 5160 
tcagggtagc tttcatttgt agcagctgag gcggaaccag ctgtaaggcc tttgagagaa 5220 
cgtggtgctg gacccgtgtc tagggcaggg gttctaaacc ctgcttacat atcagagtca 5280 
cctgagaatt ttctattttt tttttttt 5308 

<210> 2 
<211> 1159 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 
<222> (151) . . (1009) 
<223> exon 2 of AGT gene 

<400> 2 

aatccatttc ttcttttttt tgtagacatg aatggattta ttctgggcta aatggtgaca 60 
gggaatattg agacaatgaa agatctggtt agatggcact taaaggtcag ttaataacca 120 
cctttcaccc tttgcaaaat gatatttcag ggtatgcgga agcgagcacc ccagtctgag 180 
atggctcctg ccggtgtgag cctgagggcc accatcctct gcctcctggc ctgggctggc 240 
ctggctgcag gtgaccgggt gtacatacac cccttccacc tcgtcatcca caatgagagt 300 
acctgtgagc agctggcaaa ggccaatgcc gggaagccca aagaccccac cttcatacct 360 
gctccaattc aggccaagac atcccctgtg gatgaaaagg ccctacagga ccagctggtg 420 
ctagtcgctg caaaacttga caccgaagac aagttgaagg ccgcaatggt cgggatgctg 480 
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gccaacttct tgggcttccg tatatatggc atgcacagtg agctatgggg cgtggtccat 540 
ggggccaccg tcctctcccc aacggctgtc tttggcaccc tggcctctct ctatctggga 600 
gccttggacc acacagctga caggctacag gcaatcctgg gtgttccttg gaaggacaag 660 
aactgcacct cccggctgga tgcgcacaag gtcctgtctg ccctgcaggc tgracagggc 720 
ctgctagtgg cccagggcag ggctgatagc caggcccagc tgctgctgtc cacggtggtg 780 
ggcgtgttca cagccccagg cctgcacctg aagcagccgt ttgtgcaggg cctggctctc 840 
tatacccctg tggtcctccc acgctctctg gacttcacag aactggatgt tgctgctgag 900 
aagattgaca ggttcatgca ggctgtgaca ggatggaaga ctggctgctc cctgacggga 960 
gccagtgtgg acagcaccct ggctttcaac acctacgtcc acttccaagg taaggcaaac 1020 
ctctctgctg gctctggccc taggacttag tatccaatgt gtagctgaga tcagccagtc 1080 
aggccttgga gatgggcagg gggcagccct gcggacatac ctggtgacca cccttgagaa 1140 
gtggggaagt ggcctctcc 1159 

<210> 3 

<211> 568 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> (151) . . (418) 
<223> exon 3 of AGT gene 

<400> 3 

ccaggagaat gcagagtggc ctgggggggg ccaaggaagg aggctgaggc agggcgaggg 60 
gcaggatctg ggcctttggt gtctgccagc cctcattcct gcccctgtct tgggtgactc 120 
ttccctccct gtctcctgtc tggatttcag ggaagatgaa gggcttctcc ctgctggccg 180 
agccccagga gttctgggtg gacaacagca cctcagtgtc tgttcccatg ctctctggca 240 
tgggcacctt ccagcactgg agtgacatcc aggacaactt ctcggtgact caagtgccct 300 
tcactgagag cgcctgcctg ctgctgatcc agcctcacta tgcctctgac otggacaagg 360 
tggagggtct cactttccag caaaactccc tcaactggat gaagaaactg tctccccggt 420 
aggagcctcc cggtctcccc tggaatgtgg gagccacact gtcctgccca ggctgggggc 480 
ggggtgggga gtagacacac ctgagctgag ccttgggtgc agagcagggc agggccgcgg 540 
tggcacgggg ctgggcaggc ggcctgtg 5 * 8 



<210> 4 
<211> 445 
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<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 
<222> (151) . . (295) 
<223> exon 4 of AGT gene 

<400> 4 

cccaaggggg aggatgactt agttgggtga tggggggagc aaacatccct gtcgtcaggg 60 
ttgggtgcaa ggagcataag cctgcctggc ctctgggaga gccctcactg tgtggcctgg 120 
agccttccta actgtgcatc atctccccag gaccatccac ctgaccatgc cccaactggt 180 
gctgcaagga tcttatgacc tgcaggacct gctcgcccag gctgagctgc ccgccattct 240 
gcacaccgag ctgaacctgc aaaaattgag caatgaccgc atcagggtgg gggaggtatg 300 
tgtgagcctg tgtctgtgcc tgacctgggt tccaagtgtg cacagggtgg gaggcatgga 360 
tgtaagggac acagaggagg ctatgggtgg ggccagcagg gcaagaggga gcggagagta 420 
gggccaaagg tgggagagaa gtagc 445 



<210> 5 
<211> 1091 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 
<222> (151).. (941) 
<223> exon 5 of AGT gene 

<400> 5 



actgtgcatc 


ctcctggcag 


gctggaaaat 


gtccccctcc aggacagtgc 


acagcacaga 


60 


ggtcctgagc 


ccaccccggc 


cctctagccc 


tcagcaccct gggtcaccca 


gtgcgccctc 


120 


agaatgatcc 


tgatgtctgc 


tgctttgcag 


gtgctgaaca gcattttttt 


tgagcttgaa 


180 


gcggatgaga 


gagagcccac 


agagtctacc 


caacagctta acaagcctga 


ggtcttggag 


240 


gtgaccctga 


accgcccatt 


cctgtttgct 


gtgtatgatc aaagcgccac 


tgccctgcac 


300 


ttcctgggcc 


gcgtggccaa 


cccgctgagc 


acagcatgag gccagggccc 


cagaacacag 


360 


tgcctggcaa 


ggcctctgcc 


cctggccttt 


gaggcaaagg ccagcagcag 


ataacaaccc 


420 


cggacaaatc 


agcgatgtgt 


cacccccagt 


ctcccacctt ttcttctaat 


gagtcgactt 


480 


tgagctggaa 


agcagccgtt 


tctccttggt 


ctaagtgtgc tgcatggagt 


gagcagtaga 


540 


agcctgcagc 


ggcacaaatg 


cacctcccag 


tttgctgggt ttattttaga gaatgggggt 600 


ggggaggcaa 


gaaccagtgt 


ttagcgcggg 


actactgttc caaaaagaat 


tccaaccgac 


660 


cagcttgttt 


gtgaaacaaa 


aaagtgttcc 


cttttcaagt tgagaacaaa 


aattgggttt 


720 
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taaaattaaa gtatacattt ttgcattgcc ttcggtttgt atttagtgtc ttgaatgtaa 780 
gaacatgacc tccgtgtagt gtctgtaata ccttagtttt ttccacagat gcttgtgatt 840 
tttgaacaat acgtgaaaga tgcaagcacc tgaatttctg tttgaatgcg gaaccatagc 900 
tggttatttc tcccttgtgt tagtaataaa cgtcttgcca caataagcct ccaaaaattt 960 
tatctttcat ttagcagcca aacagatgta tacaattcag cagatagact gtgcaaacga 1020 
aagtgctttc ctggactttg gatggaattt ccatgggagg tctgagccag tacttagcag 1-080 
tcctttgaag t ^091 



<210> 6 

<211> 2099 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (40) . . (1494) 

<220> 

<221> sig_j>eptide 
<222> (40) . . (138) 

<400> 6 

aagaagctgc cgttgttctg ggtactacag cagaagggt atg egg aag cga gca 54 

Met Arg Lys Arg Ala 
1 5 

ccc cag tct gag atg get cct gec ggt gtg age ctg agg gee ace ate 102 
Pro Gin Ser Glu Met Ala Pro Ala Gly Val Ser Leu Arg Ala Thr He 
10 15 20 



etc tgc etc ctg gec tgg get ggc etg get gca ggt gac egg gtg tac 
Leu Cys Leu Leu Ala Trp Ala Gly Leu Ala Ala Gly Asp Arg Val Tyr 
25 30 35 



ctg gca aag gee aat gee ggg aag cce aaa gac ccc ace ttc ata cct 
Leu Ala Lys Ala Asn Ala Gly Lys Pro Lys Asp Pro Thr Phe He Pro 
55 60 65 

get cca att cag gee aag aca tec cct gtg gat gaa aag gee eta cag 
Ala Pro He Gin Ala Lys Thr Ser Pro Val Asp Giu Lys Ala Leu Gin 
70 75 80 85 



aag gee gca atg gtc ggg atg ctg gec aac ttc ttg ggc ttc cgt ata 
Lys Ala Ala Met Val Gly Met Leu Ala Asn Phe Leu Gly Phe Arg He 
105 HO H5 



150 



ata cac ccc ttc cac etc gtc ate cac aat gag agt acc tgt gag cag 198 
He His Pro Phe His Leu Val He His Asn Glu Ser Thr Cys Glu Gin 
40 45 50 



246 



294 



gac cag ctg gtg eta gtc get gca aaa ctt gac acc gaa gac aag ttg 342 
Asp Gin Leu Val Leu Val Ala Ala Lys Leu Asp Thr Glu- Asp Lys Leu 
90 95 100 



390 
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tat ggc atg cac agt gag eta tgg ggc gtg gtc cat ggg gec acc gtc 438 

Tyr Gly Met His Ser Glu Leu Trp Gly Val Val His Gly Ala Thr Val 

120 125 130 

etc tec cca acg get gtc ttt ggc acc ctg gee tct etc tat ctg gga 486 

Leu Ser Pro Thr Ala Val Phe Gly Thr Leu Ala Ser Leu Tyr Leu Gly 

135 140 145 

gee ttg gac cac aca get gac agg eta cag gca ate ctg ggt gtt cct 534 

Ala Leu Asp His Thr Ala Asp Arg Leu Gin Ala lie Leu Gly Val Pro 

150 155 160 165 

tgg aag gac aag aac tgc acc tec egg ctg gat gcg cac aag gtc ctg 582 

Trp Lys Asp Lys Asn Cys Thr Ser Arg Leu Asp Ala His Lys Val Leu 

170 175 180 

tct gee ctg cag get gta cag ggc ctg eta gtg gec cag ggc agg get 630 

Ser Ala Leu Gin Ala Val Gin Gly Leu Leu Val Ala Gin Gly Arg Ala 

185 190 195 

gat age cag gee cag ctg ctg ctg tec acg gtg gtg ggc gtg ttc aca 678 

Asp Ser Gin Ala Gin Leu Leu Leu Ser Thr Val Val Gly Val Phe Thr 

200 205 210 

gec cca ggc ctg cac ctg aag cag ccg ttt gtg cag ggc ctg get etc 726 

Ala Pro Gly Leu His Leu Lys Gin Pro Phe Val Gin Gly Leu Ala Leu 

215 ~ 220 225 

tat acc cct gtg gtc etc cca cgc tct ctg gac ttc aca gaa ctg gat 774 

Tyr Thr Pro Val Val Leu Pro Arg Ser Leu Asp Phe Thr Glu Leu Asp 

230 235 240 245 

gtt get get gag aag att gac agg ttc atg cag get gtg aca gga tgg 822 

Val Ala Ala Glu Lys He Asp Arg Phe Met Gin Ala Val Thr Gly Trp 

250 255 260 

aag act ggc tgc tec ctg acg gga gee agt gtg gac age acc ctg get 870 

Lys Thr Gly Cys Ser Leu Thr Gly Ala Ser Val Asp Ser Thr Leu Ala 

265 270 275 

ttc aac acc tac gtc cac ttc caa ggg aag atg aag ggc ttc tec ctg 918 

Phe Asn Thr Tyr Val His Phe Gin Gly Lys Met Lys Gly Phe Ser Leu 

280 285 290 

ctg gee gag ccc cag gag ttc tgg gtg gac aac age acc tea gtg tct 966 

Leu Ala Glu Pro Gin Glu Phe Trp Val Asp Asn Ser Thr Ser Val Ser 

295 300 305 

gtt ccc atg etc tct ggc atg ggc acc ttc cag cac tgg agt gac ate 1014 

Val Pro Met Leu Ser Gly Met <Sly Thr Phe Gin His Trp Ser Asp He 

310 315 320 325 

cag gac aac ttc teg gtg act caa gtg ccc ttc act gag age gec tgc 1062 

Gin Asp Asn Phe Ser Val Thr Gin Val Pro Phe Thr Glu Ser Ala Cys 

330 335 340 

ctg ctg ctg ate cag cct cac tat gee tct gac ctg gac aag gtg gag 1110 

Leu Leu Leu He <31n Pro His Tyr Ala Ser Asp Leu Asp Lys Val Glu 

345 350 355 
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ggt etc act ttc cag caa aac tec etc aae tgg atg aag aaa ctg tet 1158 
Gly Leu Thr Phe Gin Gin Asn Ser Leu Asn Trp Met Lys Lys Leu Ser 
360 365 370 

ccc egg acc ate cac etg acc atg ecc eaa ctg gtg etg caa gga tct 1206 
Pro Arg Thr He His Leu Thr Met Pro Gin Leu Val Leu Gin Gly Ser 
375 380 385 

tat gac ctg cag gac ctg etc gec eag get gag ctg ecc gec att ctg 1254 
Tyr Asp Leu Gin Asp Leu Leu Ala Gin Ala Glu Leu Pro Ala He Leu 
390 395 400 405 

cac acc gag ctg aac ctg eaa aaa ttg age aat gac egc ate agg gtg 1302 
His Thr Glu Leu Asn Leu Gin Lys Leu Ser Asn Asp Arg He Arg Val 
410 415 420 

ggg gag gtg ctg aac age att ttt ttt gag ett gaa geg gat gag aga 1350 
Gly Glu Val Leu Asn Ser He Phe Phe Glu Leu Glu Ala Asp Glu Arg 
425 430 435 

gag ccc aca gag tct acc caa cag ett aac aag cet gag gtc ttg gag 1398 
Glu Pro Thr Glu Ser Thr Gin Gin Leu Asn Lys Pro <31u Val Leu Glu 
440 445 450 



gtg acc ctg aac cgc cca ttc etg ttt get gtg tat gat caa age gec 
Val Thr Leu Asn Arg Pro Phe Leu Phe Ala Val Tyr Asp Gin Ser Ala 
455 460 465 



1446 



act gec ctg cac ttc ctg ggc cgc gtg gec aac ccg ctg age aca gca 14 94 
Thr Ala Leu His Phe Leu Gly Arg Val Ala Asn Pro Leu Ser Thr Ala 
470 475 480 485 



tgaggecagg 


gccccagaac 


acagtgcctg 


gcaaggectc tgeecetggc 


ctttgaggca 


1554 


aaggecagea 


gcagataaca 


accccggaea 


aateagegat gtgtcaeccc 


cagtetecea 


1614 


ccttttcttc 


taatgagtcg 


actttgagct 


ggaaagcagc egtttctcct 


tgg tet aag t 


1674 


gtgctgcatg 


gagtgagcag 


tagaagectg 


cageggcaca aatgeaccte 


ecagtttget 


1734 


gggtttattt 


tagagaatgg 


gggtggggag 


gcaagaacca gtgtttagcg 


egggaetact 


1794 


gttccaaaaa 


gaattccaac 


cgaccagctt 


gtttgtgaaa caaaaaagtg 


ttcccttttc 


1854 


aagttgagaa 


caaaaattgg 


gttttaaaat 


taaagtatac atttttgeat 


tgcctteggt 


1914 


ttgtatttag 


tgtcttgaat 


gtaagaacat 


gacctccgtg tagtgtctgt 


aataccttag 


1974 


ttttttccac 


agatgettgt 


gatttttgaa 


eaatacgtga aagatgcaag 


eacctgaatt 


2034 


tctgtttgaa 


tgeggaacca 


tagctggtta 


tttctccctt gtgttagtaa 


taaaegtett 


2094 


gecac 










2099 


<210> 7 
<211> 485 
<212> PRT 
<213> Homo 


sapiens 
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<400> 7 

Met Arg Lys Arg Ala Pro Gin Ser Glu Met Ala Pro Ala Gly Val Ser 
15 10 15 

Leu Arg Ala Thr lie Leu Cys Leu Leu Ala Trp Ala Gly Leu Ala Ala 
20 25 30 

Gly Asp Arg Val Tyr He His Pro Phe His Leu Val lie His Asn Glu 
35 40 45 

Ser Thr Cys Glu Gin Leu Ala Lys Ala Asn Ala Gly Lys Pro Lys Asp 
50 55 60 

Pro Thr Phe He Pro Ala Pro He Gin Ala Lys Thr Ser Pro Val Asp 
65 70 75 80 

Glu Lys Ala Leu Gin Asp Gin Leu Val Leu Val Ala Ala Lys Leu Asp 
85 90 95 

Thr Glu Asp Lys Leu Lys Ala Ala Met Val Gly Met Leu Ala Asn Phe 
100 105 110 

Leu Gly Phe Arg He Tyr Gly Met His Ser Glu Leu Trp Gly Val Val 
115 120 125 

His Gly Ala Thr Val Leu Ser Pro Thr Ala Val Phe Gly Thr Leu Ala 
130 135 140 

Ser Leu Tyr. Leu Gly Ala Leu Asp His Thr Ala Asp Arg Leu Gin Ala 
145 150 155 160 

He Leu Gly Val Pro Trp Lys Asp Lys Asn Cys Thr Ser Arg Leu Asp 
165 170 175 

Ala His Lys Val Leu Ser Ala Leu Gin Ala Val Gin Gly Leu Leu Val 
180 185 190 

Ala Gin Gly Arg Ala Asp Ser Gin Ala Gin Leu Leu Leu Ser Thr Val 
195 * 200 205 

Val Gly Val Phe Thr Ala Pro Gly Leu His Leu Lys Gin Pro Phe Val 
210 215 220 

Gin Gly Leu Ala Leu Tyr Thr Pro Val Val Leu Pro Arg Ser Leu Asp 
225 230 235 240 

Phe Thr Glu Leu Asp Val Ala Ala Glu Lys He Asp Arg Phe Met Gin 
245 250 255 

Ala Val Thr Gly Trp Lys Thr Gly Cys Ser Leu Thr Gly Ala Ser Val 
260 ^ 265 270 

Asp Ser Thr Leu Ala Phe Asn Thr Tyr Val His Phe Gin Gly Lys Met 
275 280 285 

Lys Gly Phe Ser Leu Leu Ala Glu Pro Gin Glu Phe Trp Val Asp Asn 
290 295 300 



Ser Thr Ser Val Ser Val Pro Met Leu Ser Gly Met Gly Thr Phe Gin 
305 310 315 320 
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His Trp Ser Asp He Gin Asp Asn Phe Ser Val Thr Gin Val Pro Phe 
325 330 335 

Thr Glu Ser Ala Cys Leu Leu Leu He Gin Pro His Tyr Ala Ser Asp 
340 345 350 

Leu Asp Lys Val Glu Gly Leu Thr Phe Gin Gin Asn Ser Leu Asn Trp 
355 360 365 

Met Lys Lys Leu Ser Pro Arg Thr He His Leu Thr Met Pro Gin Leu 
370 375 380 

Val Leu Gin Gly Ser Tyr Asp Leu Gin Asp Leu Leu Ala Gin Ala Glu 
385 390 395 400 

Leu Pro Ala He Leu His Thr Glu Leu Asn Leu Gin Lys Leu Ser Asn 
405 410 415 

Asp Arg He Arg Val Gly Glu Val Leu Asn Ser He Phe Phe <Slu Leu 
420 425 430 

Glu Ala Asp Glu Arg Glu Pro Thr Glu Ser Thr Gin Gin Leu Asn Lys 
435 440 445 

Pro Glu Val Leu Glu Val Thr Leu Asn Arg Pro Phe Leu Phe Ala Val 
450 455 460 

Tyr Asp Gin Ser Ala Thr Ala Leu His Phe Leu Gly Arg Val Ala Asn 
465 470 475 480 

Pro Leu Ser Thr Ala 
485 



<210> 8 

<211> 681 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> (151) . . (531) 
<223> altenate exon 1 

<400> 8 

agggcgggga aggggggagg 
ggatttgacc tgcaattgaa 
catcctccca cagaggcact 
aggggaggct ggcacattct 
tcagaccatg gctctctgtg 
cgctcatgga gggtgactga 
ttacggctcg cattctccag 



of AGT gene 

aagggggact gagactctcc 
cctttcctca aacacaaggc 
gtgaatgtat aaagaaaggc 
gctccttagc taaccgcaca 
gtctatcttt gccattactt 
agatgccacc ccgaggtgca 
ccggaccagg cagttgagga 



tgtttctata tgctatgatg 60 
acaaacctgc cccagcccat 120 
gatgctaaga aacgtgctgc 180 
ggcccatggg tccacacaca 240 
taaccaggct gtttctcatg 300 
ccctcagcct cgctggacac 360 
agtcatcgaa gtccagctgg 420 
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agctcctcat ccgcatacct gagcacaatc agctgcagga ggtggctgct cagctgaaag 480 
cctgtgggga caaaggagga gacccaggac tcgctgtatc cgagccccga ggtgagtctg 540 
cacctttgag ggagctccca ggcagctgag gagagggctc agaagaggaa gccactgttt 600 
cactcctcaa agctggggtc cccagaccca ■ gtgggactgc tgcccttggt gtgaatgcag 660 
cgtgagagtg tgaatctgtg g 681 



<210> 9 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence :M13 forward 
primer 

<400> 9 

gttttcccag tcacgacg 18 



<210> 10 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence :M13 reverse 
primer 

<400> 10 

aggaaacagc tatgaccat 19 



<210> 11 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 11 

cctcgaagtg gcctatggaa 20 



<210> 12 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 12 

gcgagactct atgggtgatg 20 



<210> 13 

<211> 20 

<212> DNA 

<213> Homo sapiens 
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<400> 13 

gactgagact ctcctgtttc 



20 



<210> 14 
<211> 19 
<212> DNA 

<213> Homo sapiens 
<400> 14 

cggggtggca tcttcagtc 19 



<210> 15 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 15 

tggcacattc tgctccttag 20 



<210> 16 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 16 

atacagcgag tcctgggtct 20 



<210> 17 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 17 

gaccaggcag ttgaggaagt 20 



<210> 18 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 18 

gattcacact ctcacgctgc 20 



<210> 19 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 19 

ccactttcca gcatgttacc 20 



<210> 20 
<211> 20 
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<212> DNA 

<213> Homo sapiens 
<400> 20 

ctgctattcc tcctgatggc 



<210> 21 
<211> 20 
<212> DNA 

<213> Homo sapiens 
<400> 21 

acaggcccat gtgagatatg 



<210> 22 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 22 

tggtattagg gcactgaagg 



20 



20 



<210> 23 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 23 

tgtgttgaag caggatcttc 



<210> 24 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 24 

gatgcaggca ttgaaagatg 



<210> 25 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 25 

gtgtttaaca gtctccccag c 



<210> 26 

<211> 20 

<212> DNA 

<213> Homo sapiens 



<400> 26 

ctaggtgtgt gacagcctga 



20 
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<210> 27 
<211> 19 
<212> DNA 
<213> Homo 

<400> 27 
catgtccctg 



sapiens 
tggcctctt 



<210> 28 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 28 

tgaggggtgg ggatggag 



<210> 29 
<211> 20 
<212> DNA 

<213> Homo sapiens 



<400> 29 

ctggtcatgt gaaacttacc 



18 



20 



<210> 30 
<211> 20 
<212> DNA 

<213> Homo sapiens 
<400> 30 

accttcagct gctccaaaga 



<210> 31 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 31 

ccactttcca gcatgttacc 



<210> 32 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 32 

ctgctattcc tcctgatggc 



<210> 33 

<211> 18 

<212> DNA 

<213> Homo sapiens 



WO 00/71751 



PCT/US00/13327 



16 

<400> 33 

aaccctcctc tccagcct 18 



<210> 34 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 34 

tcatcaccgt gcctcctc 18 



<210> 35 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 35 

tacttggact ttgggctgag 20 



<210> 36 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 36 

ttgagcaagg cactttgttt c 21 



<210> 37 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 37 

ggctaaatgg tgacagggaa t 21 



<210> 38 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 38 

tagggccttt tcatccacag 20 



<210> 39 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 39 

aatgccggga agcccaaaga 20 



<210> 40 
<211> 20 
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<212> DNA 

<213> Homo sapiens 



<400> 40 

tccaaggctc ccagatagag 



20 



<210> 41 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 41 

ctctccccaa cggctgtctt 20 



<210> 42 

<211> 20 

<212> DNA 

<213> Homo sapiens 



<210> 43 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 43 

ctctatctgg gagccttgga 20 



<210> 44 

<211> 20 

<212> DNA 

<213> Homo sapiens 



<210> 45 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 45 

tgaagcagcc gtttgtgcag 20 



<210> 46 

<211> 19 

<212> DNA 

<213> Homo sapiens 



<400> 42 

ctgcacaaac ggctgcttca 



20 



<400> 44 

cacagcctgc atgaacctgt 



20 



<400> 46 

tccccacttc tcaagggtg 



19 
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<210> 47 
<211> 20 
<212> DNA 

<213> Homo sapiens 
<400> 47 

acaggttcat gcaggctgtg 20 



<210> 48 
<211> 19 
<212> DNA 

<213> Homo sapiens 
<400> 48 

tccccacttc tcaagggtg 19 



<210> 49 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 49 

tgcctcagtg aactcaaatg g 21 



<210> 50 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 50 

ctccccctcc aaaactacca 20 



<210> 51 

<211> 19 

<212> DNA 

<213> Homo sapiens 

<400> 51 

caggagaatg cagagtggc 19 



<210> 52 

<211> 19 

<212> DNA 

<213> Homo sapiens 

<400> 52 

gcaggcgctc tcagtgaag 19 



<210> 53 

<211> 20 

<212> DNA 

<213> Homo sapiens 
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<400> 53 

agcactggag tgacatccag 



<210> 54 
<211> 20 
<212> DNA 

<213> Homo sapiens 
<4O0> 54 

gatggaggac tggtagacag 



<210> 55 
<211> 20 
<212> DNA 
<213> Homo 

<400> 55 
aagggggagg 



sapiens 
atgacttagt 



<210> 56 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 56 

gaaattccat ccaaagtcca g 



<210> 57 
<211> 20 
<212> DNA 
<213> Homo 

<400> 57 
gtgcaaggag 



sapiens 
cataagcctg 



<210> 58 
<211> 19 
<212> DNA 
<213> Homo 

<40O> 58 
ctactctccg 



sapiens 
ctccctctt 



<210> 59 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 59 

aagggggagg atgacttagt 



<210> 60 
<211> 21 
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<212> DNA 

<213> Homo sapiens 
<400> 60 

gaaattccat ccaaagtcca g 
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<210> 61 
<211> 17 
<212> DNA 

<213> Homo sapiens 
<400> 61 

tagccctcag caccctg 



17 



<210> 62 
<211> 19 
<212> DNA 

<213> Homo sapiens 
<400> 62 

tgtccggggt tgttatctg 
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<210> 63 
<211> 20 
<212> DNA 

<213> Homo sapiens 
<400> 63 

agaacacagt gcctggcaag 
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<210> 64 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 64 

ttcacaaaca agctggtcgg 



20 



<210> 65 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 65 

agtgtttagc gcgggactac 



20 



<210> 66 

<211> 20 

<212> DNA 

<213> Homo sapiens 



<400> 66 

tttggaggct tattgtggca 
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<210> 67 
<211> 20 
<212> DNA 

<213> Homo sapiens 
<400> 67 

tgaaagatgc aagcacctga 



<210> 68 
<211> 20 
<212> DNA 

<213> Homo sapiens 



<400> 68 

gaaagcactt tcgtttgcac 
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SEQJbHC£ LISTING 

<lin> Mt-Grail, Maura 

Kussell, Dean no 1*. 
Shi>l , i.*.J!ik / Doruja M. 
>lY>iAd Genetics, Inc. 

-r.nO> DIABETES CENS 

<130> Diabetes G&ne 111 

<140> 
<141> 

<iD0> US 60/174 , 7IJ0 

<151> 2300-01-06 

<1S>0> US 63/ 135, < 23 
<151> 1999-05-21 

<lhll> 69 

cl70> Patanbln Vis*. 2.0 

<210> 1 
<211> 530H 

<2tl3> homo sapiens 

<220> 

<221> ml AO faatuxe 
<2M> [3012) 3047] 
<223> &xon 1 of ACT gene 

<4 0Q> 1 

cjr.se cageca cttgacccaa cctgciag&lC 
ttaagggtrt-. ttqqijcl.cjl.gi: rraaagtggct 
gcaaagcaat tgtttaaagt acagctCU.t 
tctatateq* <Mft»qLa«tt!i gtctgatctc 
notgectgae taatacatca tgctct-rjcLii 
attctgauti) Mlqqt*l.<:Lg r.tctcacgct 
tttataaaga aaaagaggtt tcu^gactt 
aobix^l.ggtg gaaggtgaag gaggagcaoc? 
gagaaagcat gttcagggga ufilvccccttt 
c&i'-taf.caca agaacagcac cagaa^Ai.c: 
qtccctcccg tgaoacatgg qAattatggg 
a?ci;t»*Af.&a tatcagatgg cttattt.<Jtft- 
cagtcttcat ttagccacag at*Acrntctg 



Acaaaacccc gtgttgtata aaagtaarjqt 60 
ggggcaaeqi: Lftftaaattaa caacatctca L20 
r.tcaaaatgg agtctcttat gtstfceCCU 1&0 
tctttttttb ftfcctacatat cccaatagca 240 
actccatatg tggctggttt cct.qqatcny S00 
get tteH;tiyoq ac;ataccaga gactgggtaa 360 
atgtgttcca cgtgtjctgyi) qa^gectcac 420 
wgi^catctt acatggtggc tggco*wi« 4 HO 
ataaaaewt wyAtctcat gagacttatg 5« 
caccctcatg attcMttac ul.cccactgg 600 
agctaoaatl qgagatttgg gtggggacac 660 
ttctatgtag aacctctqc*, 1 1: teat tea a 720 
tccctaactt: ccq^tqatgg aa tgtacaca 780 
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iraagaaactt ccactgatgg aatgaaccics 
gaatctq^nc" tqiml-iu»AAl- gtc-tgcagtc 
tttacattct caataaaatt ctctoq^aoU 
r.TtlafcQl.aga agat^cccca gctgatagtc 
caatcttgca acaagtt-aa <)u»i)Ar.att > .ff 
ILitr.tcstgtt ctccccagta acgtgtcatt 
aaatttnrjcuj gijtustqH/iH Ltfctatttte 
tcagttgttt attatctttg ctgtt ippl-rj 
ijrj<;<:i*aggca r^cggaaac tctcccaaag 
tgctgagcnc caggaata<it ijxvln*At.qA 
tatttaaaca ctgggcacaa ggttgttgct 
gtgtccaggt cgacttgttu rjLtqcagttaa 
gl.ggcacagc caggcctagg accccagtct 
ggagcacttt gantggtctc ttattttaaa 
gagacagagt cttactctgt agcccaggct 
laacctccat <5tGCtgg<rr:t caaaaagtgt 
gcccgaatgt tctgttcaga coaivjvwcx: 
at.f.tttgagg Aqtcr.otatc tataggaaca 
acaggcccat gtgagatatg atttttttaa 
gcjtatttgli^ ti)l.lli|fctq* litgttcagtc 
ggcaccatct cttcctgctc tttgttnU.a 
gccatcgtrgg atatgecgtg gctccttgaa 
'ctgtcccttc agtgccctan tt"j:<:*t:gtat 
tgcctcaccc actgcgtcac ttgtgatcac 
gtcatgccct gttataacga ctitcatiMC|C: 
aaccctttta etggtcatgt gaaacttacc 
cagcacatct ttcaaxgcct gccsLt^tgaa 
gtgctattgt tggtgtttaa cagtetccoc 
ttetettgct tqtgtrrtttt. rrcccagtgtc 
ttggagcagc tgaaggtcac acatcccalq 



2 

r).9Mi}Ql^ttl. ar;r.gggiiaga aaactggcet B*0 
agtttgaatg gctgctcctt atgggaatna 900 
l.l.ietq-attq attntaatga gctttaaagc 960 
agccttgggc *itggatr««jn cjij)i;t<;t.t.#tfc 1020 
tttattgggt ccaatctaac caaaattatt 1080 
at t sen]* gas ritttqqBtliJ c:tUmAgyiWS llflfl 
ttttacacca ctTtccagca tgttacctga 1200 
qagi.gaU:**. trcaagggcc cgaggcagga 1260 
accaggatct ttgtttitgtt ccctgacata 1320 
aacaaattgt gaggctttaa agagccgaaa 13B0 
taatcagtgo tsnjatuctt « ocleeccotl- mo 
accacttgcrt gatcctcaaa caactagtta 1500 
ctactgttcc anctaacccc* ttcqcwrowi 1560 
aaaattaaat taaaattgtc tatttattta 1620 
Kgeutqctfql. qgLgiuiAteA tagntcactg 16R0 
ttgaattaca gatgcgaggc act fftaectq 1^40 
t^taagtngc: tgtggggccc cagacaagtg 1800 
aagtaattaa aaaaatgtiat tt^geiattt 
atqaOqatU fi^qtAAzqq gtaaaaaaga 1920 
agtgaatgta cagcttctgc ctcatatccu I960 
aatgttccat tcctgggtaa ittcatgtct 2040 
-cctgcttrjtg ttq««qc»M Atettcette 2lOO 
ttaaggctgg acacatcacc actcccaacc 2160 
tggcttctgg cgntClft** cutHqqtjdtct 2220 
aagtcrttacc tataggaaaa taagaattat 2280 
atttgcaatt tquicngcAl: aaacacagaa 2240 
ggeattttgt ttgtgtcttt cnatctg^ut 2400 
agctaeactg gaaacttcca gaaggcactt 2460 
tattagaggc ctttqOKttft ggtsggctet 2520 
agngggcagcr agggtcagaa giggcccocg 2580 



WOM/7t75l 

tgttgcctaa gcsagactct cccctgccct 
ctgtggcctc c 7.gggcjtjl;af; aictcccggg 
caggctgtca cscacctagg gagatgctcc 
caaacttcgg ^aaat.gtg-a actcgaccct 
tc-gcatcga Tcactaagoc ttcct.iw*»cj 
ctgtccttct ggccagcctg rggtctggcc 
<:ac;ag kI.i}L)<]m/»;:m<| ftLccatCCCC 

ccggccaggg gaagaagctg ccgttqtxr.-t 
cccctcagct ccttctcggc cttgtctctc 

H^eitjquncq <H9<igft<19C* cqqlgAtQAn 

tcatccggac gcgggctttc ccccactcgg 
ct.cc-t <\ot>#tv<M<?i s - Aagatggtca 

gtcctcagga acggagcaga ctggtacagg 
ggtc^tcUi) K»tc99g*ijj <5<*al".agcagg 
ggccgggctc ggtC2gtctc cagctgggat 
agacagantg i*f:l.i\£cU)U) l;gtnatcagg 
ctcacctata aaatggaact aataacgatg 
tagctgaaga gAarH.aa I. at aacatgaaag 
gcagattcca attttattcg agggcarjcqt 
gtcagggggc ctggttctga gtcctagttc 
a<ar;cc»ctggn octet ctgtt cct.cl.i^ A »» 
ecaggatgea gtaggtccca gggactatcc 
cagr.qqcj^ac cctgttccgg ijijel:cacatt 
acagaataag tgcataacta catggtacag 
al;qattaq3ijt tgagagtijcc ixggtgtggtg 
gggaaacctc agggaaggtg aggactatgg 
cjccwtgciiaa tggcaggcac v.™*aggagca 
cccgggagaa gtagggagac atagaagggg 
aggagaaggc ggaggggcac t.C*gggcgag 
tactcccagg acgcggctgc gtgeagaect 
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ctgccctctg 


eacctccggc ctgcatgtcc 


26<0 


gctgggtcag 


aaggcctggg tggitggcct 


2700 


cgtt tct rjyq 


cut t ggee ccg act cct g 


2760 


gca ccggctc 


acictgitca gcagtgaaac 


2B20 




tit ii tit »i t c<i nt t ct a neat 


2880 


aagcgatgta 


accct octet ccagcctgtg 


2940 


acccctcagc 


taraaatagg geategtgae 


3(1 DO 




<2 t)C9# ii y h u 0 <) r^r-<3 i|r j<]<) o 


3060 


t ca gat. gta a 




3120 


l:cfr*<4 Aaacr: 1. 


c l. cicc ft I. c: 1 . i . a I*, a ag a acjt rj 


3D HO 


4- o r^r^fn prf n 

% w^Wtl WW v y 


n rs t « h ^ ji cl;* nnrinn^nrt n 




not Ct fc CCt Of 


tt cca ct oa<3 aaa aat t aci</ 


3300 


Uul| a y La 






n ct 1* a toe a ci 




3d20 


w w^uo y bcvj t. y 


•y l k« w w low www l w ww wvy 




ct gg t LdC eg 






w w l y sj l ww 


nt et e aci n on ct ct not! a 


3600 


t a ctt t a a 




3660 


m fT/i tt rjirt* err! 




3720 


t Ct ct ett cc 


caactotatnr acattaaaca 


3780 


hi i: #<j r.a 1 .g 9 « 


lU^AALtfidl.l M^LacttcU 


3B40 


Caaaaatata 


ggctgtatta gtaaacacaa 


3900 


cacatcagag 


craaacagaca aa gaegctgg 


3960 


□gggttataa 


QsaciCQaaaa qqtrciagctgq 


4020 


gggaaagcrtg 


cagggtgaaa tact gca tea 




tgaggtcnga 


ggggxtgata t^nt^y^qt 


4140 


tgagccgti:a 


tcttcacctt tagcattcag 


4200 


caggtgcrtgg 


ccaiigciggca giji^casgaq 


42W 


ggtgtcaggc 


ccgccacccc agagcaccat 


4320 


gganccagcc 


tag<jq?»9c*iii Cw^c/igatca 


4300 
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caactgagaa ccLoacgacng fci;l.<:vi^:ccc 
gacgcU^m: gagcaggagc agcacagtga 
cccgaacagt iggcgcttgt caqqc«Kiqrt<: 
ctgtttj^uftl. i}At.r.tgaT.gt gactgtctag 
ctgcacagaa agn r-ct.qci>*i i*< gaga 
tggctcttcc tgagaacagc ggntggcigtu 
ctgag^tccq »nl.i.l.<jyi:M. 1:tgaatgact 
gcagctgcac cgtgttgagg gcnygtgclM 
wcK?ti*l-A A 9 ftA<^aat.gACT. gggtatgctt 
tggccttgag rcagtgatga cucqutovwij 
gacagaaaag gccaagaagt gacgotattr? 
agati*tqijl.A cadmi1.i)I,U gaattgaatg 
tcaagaagag tcagatgaaa tgaattctqrc* 
ttagqgtago tttcattt.gt agcagctgag 
cgtggtgctg gacccgtgtc tagggce"39q 
or.u.qe«cfdAlC i:tfti:*r.r.m. tttttttt 

<210> 2 
<2ll> 1159 
<212> DMA 

<213> Homo sapiens 
<220> 

<221> nisc_featuxe 
<2Z2> 1151) . , (1009) 
<223> *xcm 2 i>l' AGT qfcfJ* 

<400> 2 

*atccaU:1-<r: I.Letl.tttl-1 tgtagacatg 
gggaatattg agacaatgaa agatctggll 
cettfe*aecc tttgcuiAAftt gatatttcag 
atggctcctg ccggtgtgag cctg*M<J l -<: 
Ctggctg^aq. gtrga^sggr. gtacatacac 
acctgtgagc agctggcaaa ggcu*6t.gc:<r: 
<jCtC<i£«tt^ *ijqr.r;?>*qtf«: atcccctgtg 
ctagtcgctg caaaactTga caccgMtyM 



4 

aaaaatggcc catggaattg cijtjctctg^ 444 0 
gcgggctgcs li^n»^cg ccatecaaac 4500 
l:lxcti*;agcag tcggttccca caggtttccc 4560 
attaggtgtg a<»c^i)qti*i)<: LCaggcttet 4 620 
gagttttctg ttccattttt ccntgtcatg 4 680 
pa* Iqcatqq ggagf.ggggt gagatggtag 4 740 
gaagcagaac aa a acacacc i>qryttfot.tt-t« 4 HOC 
qvL^gggtc tgggtgaggg aagccagctg 4B<S0 
agatgaagca gaaaaiitct* qij^AL^AAgg 4 £20 
rHecaaggaa gcctggcc-ia gcccxggggg #980 
o*cib»cnKtxi i^i^cLccacaa gaaatgagtg !>O40 
aatcaataga ataaacgttc ptcttttlw!* 
ijqqtfcAgiu^: aagaacctct tctgggggtc 5160 
gcggaaccag ctgt iiag<jix- U.fc^agagaa 522 1> 
L)hl,&hA6iitt: ct.gcttatat atoagagtca 52Bt> 

5508 



aatggattta txctgggcte .™i<3<fl-gaca 60 

ac^t.ggca^r. taaaggtcag ttaataacca 120 

gqtatgcgga agcgaiTcvjut: ccagtctgag 180 

accatcctct gcctcctggc ctgggctggc 240 

cccrttccncc t«qlcaue« caargagagt 300 

gggaagccca aagaccccac ct teat acr*t 360 

gatgaanngq uwiACAgga ^cagctggtg 420 

aagttgaagg ccgcaatggt cgggcrtgCtg 4130 
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pccupctl.ftl. i.^oc:l.r.ccg tatatatggc 

ggggccaccg tcctctccr-c i«wt:cj^i*:^tc 

gocttggacc scacagctg« ccigqc:.t.c?u«<j 

a o c t g ca cx: t i;r:« j q c I . ij 94 tgcg caca ag 

ctgctagtgg cceagggcag 

g gcg 1 9 X U-fl 1 -A q r.r*: rragq cct gca cctg 

tataccccJrg tggtcctccc ctdir.:U:iAl.y 

i*.7i4&l.!:gaca ggttcatgca ggctgtgacn 

gccagtq^cjq wyir^iMftOi^L ggctttcaac 

c.r.ctctgctg gctctggccc taggacttwrj 

*nj}qrj^Ll.ft<ja gatgggcagg gggcagccct 

gtggggaagt ggcctctcc 

<2l0> 3 
<211> 568 
<212> J>K>A 

<2l3? Homo sapiens 
<220> 

<22 1 > m L L« * In ri* 
<222> [151) . . (41B) 
<223> exon 3 of AGT gene 

<JIU» 

i:<\f*ggagaal: gfcagagt.gge ctgggggggg 
gcaggatctg ggcctttggt gtfrl.gctcagc 
ttccetccct gtctcctgtc tggal-ttoacj 
a gec v-tiy i; $1.1.0.1 m I g gaca a cag ca 
tgggcacctt. ccagcactgg agtqa^Atfcc 
tcactgagatf cquctqwtq tftgetgatcc 
tggagggtct cactttccag caawoUf^ 
ag gamete*: tfiMkutcciift tggaatgtgg 
ggggtgggga gtagacacac ot^dgcrtgag 
tggcac^cjg cx^qciAqqc. ggcctgtg 



5 

atgcacngtg agctatgggg cgtijqtc.Cv?*: 5<0 
tttggcacce tggectctct ctatctggga 600 
i*<:M»l.r.£l.gij gl.gll. ncl.t.g gaaggacaag 660 
gtcctgtctg ccctgcaggc tgiacagggc 720 
caggcccagc tgctgctgtc cacggtggzg 780 
aa gca geegt ttgtgcaggg wtggcbcU: H4U 
g&i*;l;ti;aeag aactggatgt tgetgetgag 300 
ggatijganr^ ul.<jqi.rUii;U: <:i^r.qafcggg«i 36Q 
acctacgtcc acttccaagg taagrjeaaac 1020 
\#l,i;C4«U)l. C)1.aqi^l:<|ftga r.cagccagtc 1080 
geggacatoo ct<v.itgaccci vtr.<X\.<iht**A 1140 

1159 



uf.o^ncjawijq a^ol.tjaggc agggegaggg 60 

cctcattcct gcccctgtct tgggtgactc 12D 

cjg£agAt.gaa gggcttct.ee crgcxggccg 180 

cctcagtgtc trjttt;cc*tq ^atatqqoA 240 

aggacaaett cteggtgact caagtgccct 30 0 

agcctcactn tgoetctnsK! utg<J ac * a 6S 

tcaactggat gaagaaactg tctccccggt 420 

gagccacact qUtczLqcxx* ggctgggggc 480 

ccttgggtgc agagcaijggu nqq^gi^gg 5*0 

569 



<210> 4 
<211> 445 



WOW71751 



FovusM/iasr? 



<212> DKA 

<213> Hrtmi> .s«pi.rtJlH 

<220> 

<221> mice fwtWTO 
<27.2> (Ihl) . . <24h) 
<223> ex on d cf AUT gene 

<40D> 4 

cr;caaggggg aggatgactt ag^tgggtga 
1 1 qq n bq » * i) I . « ag Gft t. g c-f; tg gc 
agccttccta acigtgcatc atctccccaij 
q<;U}<:ftagga tcttatgacc tgeaggaect 
gcacaccgag crgoacctgc aiioaiittnuq 
L.qLqagcm.g tut ^tgtgcf; tgacctgggt 
T^taoqqq^- vi^qoqi^qi) iil.A'..gggtgg 
gggecaaagg tgggagagaa gtagc 

<210> 5 

<211> 1091 

<212> DNA 

<213> Btnra> .rof > i.cn* 

<220* 

<221> misc_feature 
<223> e*on 5 of ACT g&ne 
<4IH» ft 

actgtgcatc ctcctggcag gctggaaaat 
ggtcctgagc ccaccccggc octet oqci-<: 
agaatgatcc tgatgtctgc tgctttgcag 
geggatgaga gagagcccnc artvqlctaxc 
gtgaccetga accgcccatt cctgtttgct 
ttcctgggcc gcgtggccac* i&c get gage 
tgcctggeaa ggccs.ct.gcc cctggccttt 
oggacaaatc agcijiitgtgt Cftfrcc ccaq 1 
r.gagctggaa ageagcegtt tctccttggt 
agcctgcagc ggcacaaatg cctcd'.cftcag 
o,gggaggr;aa gAa^r:agtgt ttagcgcggg 
cagcttgttt gtgaaacaaa t?aagtgttcc 



6 



tggggggagc aaacatccct gtcgtcaggg 60 

ctctgggaga gccctf:ar.tg tgtggcctgq 12I> 

QttOCtttCGiU? no Otfcat qc UCOC^&tqqt 100 

getcgcccag getgagctge ccgccattct 24 0 

r.:.7aU;at;ui)<: aL£:j»ijqqV.qg yQgtiqg 3 1 JO 

tccaagtgtg cacagggrgg gaggcatgga 360 

ggecagcagg gcaagaggga gcggagagta 420 

4 45 



gtocccctcc aggacagtgc aonqr^wijo §0 

tftAQ£A&nr.t gggtcaccca gtgcgccctc 120 

gtgetgaocn. genttttttt tqtqctfcfpiJ .1*0 

caa^agctta acaagectga ggtcttggag 240 

gtgtatgatc anagcgccac tqccct-qcac 300 

acagcatgag gccagggccc cagaacacag 360 

yssjrjcassss usaq&ag^ag ataacaacce 42U 

ctcccacctt ttcttctaat gag^cgnctt 480 

ctnagtgtr^: t^CAtcjo/jgt gagcagtaga 54 0 

tttgctgggt ttattttaga gaatgggggt €00 

aotactgttc w***&Q*at. tccaaccgac 660 

cttttcaagt tgagaacaaa actttqqUttt 720 
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uifstuibU*** q'-*l.»CigiU.*~ I.L.gijAttgce ttcggtttgr atttagtgtc ttgaatgtaa 780 

gaacatgacc tccgtgtagi gtctgtaata ccttagtttt ttccneagat rictttitgait 840 

UXiiAfiCtrit ai:gr.cjAaAga tgtaagcacc tgaatxtctg tttgaatgcg gaaccatagc &00 

tggttatttc icccttgtgt tngtautaap cqtcttgcw i^wl.*.*qwl. t 9G*0 

tatctTtcat ttagcagcea aacagatgta tacaattcag cagatagact gtgcaaacga 1020 

aagtgctttc ctggacttig gatgq*u»ttt cutft'.qi?q*i?q U:U?«qi;i;«<j L»<:l # tt»q&tiq 1080 

tcctttgaag t 1091 



<210> £ 

<211> 2099 

<ZVt> DNA 

<2i3> homo sapiens 

<221> CDS 

<222> f^lOl - . (Id 94) 

<220> 

<221> Riq_p«pr.irfe 
<222> [4D>- . (138) 

<flD0> 6 

aagaagctgc cgr.cgr.r.i^Lg ggtactacag cagaagggt atg egg aag cga gca 54 

Met Arq Lys Arq Ala 
1 5 

ccc cag tct gag M.g get cct gec ggt gtg age ctg agg gec acc ate 102 
Pro Gin Sex Glu Met Ala L'ro Ala Gly Val Ser Leu Arg Ala Thr He 
10 1!» 20 

etc tgc etc ctg qrzcz egg get ggc ctg get gca ggt gae egg gxg tac 150 
Leu Cys Leu Leu Ala 'J*rp Ala Gly Leu Ala Ala Cly Asp Arg Vnl Tyr 
25 30 3* 

ata cac ccc ttc cac etc gtc ate cac aat gag agt acc tgt gag cng 198 
1 1&. Hi* Pro Phe J3is leu V*l Tie Hi. -a IKAn PI u TJir CyA Glu Girt 

40 45 50 

ctg gca aag gee aat gec ggg waQ kjou aw fcU: <M.a CCl 

h&M Aid l.y* Als> Ayn AIa Gly Ly* Pro LyB Asp Fro Thr Phe lie Pro 
55 60 65 

get cca att cag gec aag aca tec cct gtg gat gna <jec eta i>wg 294 
Ala Pro lie Gin Ala Lys Thr Ser Pro Val Asp Glu Lys Ala Leu Gin 
70 7b B0 85 

gac cag ctg gtg eta gtc get r.hk cpii: aitr: <j*a gac aag ttg 34 2 

Gin Leu Val Leu Val Ala Ala Lys Leu Aep Thr Glu Asp .Lys leu 
90 95 100 

aag gee gca atg gtc ggg atq er.g tjec aae ttf: ttg ggc ttc cgt ata 390 
Lys. A3 it Ala Met Vul Gly M*t Leu Ala As* k>he .Leu Gly I'he Arg He 
10* 110 115 
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tat cjci i- Hlq <:*k age gag eta tgg gge gtg gtc cat ggg gec acc gl.t: 4liH 

Tyr Gly Met his Sex Glu Leu Trp Gly Val Val His Gly Ala Thr Val 
120 3 i;in 

etc tec oca acg get gtc ttt gge acc ctg gec tct etc tat ctg gga 4 8b 

Leu Scr Pru Thr Al^j V* I Pht* <Uy Thr 1,<-u Al* ftar l.wu Tyr Les» <5J y 
rjh HO US 

gee ttg gac cac aca get gnc ngg eta cag gca ate. etg ggr gtt cct 534 

Ala Leu Asp His Tur A".* Atifi Arq Leu Gin Ma 7I<; Lou <3J y Val Pro 

150 15S 160 165 

tgg aag gac aag aac tgc aec tec egg ctg gat gcg cac aag gtc ctg 582 

Trp Lys Asp Lys A.\;n Cy* Thr Sr./* Arq T.«u h*\> A 1 1» fliy T.y*; V#l Leu 
170 175 1B0 



tct gec etg cag get gta cuq gqs etq eta qtrj gee cag qge am get 630 

5*r Ala Lftu Gin Aid Val Gin Gly Leu Leu Val Ala Gin Gly Arg Aid 

IBS 190 195 

qat age *^**q ij<:i: <:^lj »-l.g '-'-5 i-l:g fcert Ai-u gl.i> gl.g ggi- gLo LC.c aoi 6*7{J 

Asp Ser Gin Ala Gin Leu Leu Leu Set Thr Val Val Gly Val The Thr 

200 205 210 

tjrxi era <:Lq owe aag cag ccg ttt gcg cag gge ctg get ctr. 726 

Ala Pro Gly Leu hie Leu Lys Gin Pro Phe Val Gin Gly Leu Ala Leu 

215 220 225 

t.Al a*:<* cel. gvg gtc cm cgc tct ctg gac ttc aca gaa ctg gar. 774 

Tyr Thr rro Val val Leu Pre Axg Sex Leu Asp Phe Thr Glu Leu Asp 

230 235 240 £4 5 

gtt get get gag a*g att gac egg ttc atg cag get gtg aca gga tgg 822 

Val Ala Ala Glu Lys He Asp Arg Phe Met Gin Ala Val Thr Cly Trp 

250 35* 260 

aag act gge tgc tec ctg acg gga gec agt gtg gac age ace ctg gci 870 

Lys Thr Gly Cys Ser Leu Thr Cly Ala Ser Val Asp Scr Thr Leu Ala 

2 65 270 275 

ttc aac acc tac gtc cac ttc caa ggg aag atg aag gge ttc tec ctg 51 B 

Phe Asn Thr Tyr Val Bis Phe Cln Cly Lys Met Lys Cly Phe Sor Leu 

28U 2BS 230 

etg gec gag ccc eag gag ttc tgg gtg gac aac age acc tea gtg tct 966 

Leu Ala Glu Pro Gin Clu Plic Trp V&l A*p Av« Ssr Thr Sfcr Val Scr 

29b 300 205 

gtt ccc atg etc tct gge atg gge acc ttc eag cue tgg agt que «te 1014 

Val Pro Met Leu Ser Gly Met Gly Thr Ph«j Bin Kia Trp Ser Asp He 

33 Q .31* 320 325 

cag gac aac ttc teg gtg act cao gtg ccc ttc act tjuq aqr: gec l:gi: 10S2 

Gin Asp Asn Phe Ser Val Tbr Gift VaI Pro Phe Thr Glu Ser Ala Cys 

330 335 340 

ctg ctg ctg ate cag cct cac tat <*cc tct rjw i;i.g g*c: **g gtg gag 11*0 

Leu Leu Leu lie Gin Pro Pi* Tyr Ala Ser Asp Leu Asp Lys Val Glu 

345 350 355 
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ggt etc act ttc cag caa aac tec etc aac tgg atg aag aaa ctg tci 1158 
Gly Leu Thr Phe Gla Gin Asn Ser Lou Asn Trp Mot Lys Lys Leu Ser 

sso 365 :r/n 

ccc egg acc ate eac ctg acc atg ccc caa ctg gtg ctg caa gga tc- 1206 
Pro Arg ?hr lie His Leu Thr Mut Pro Cln l<r.u Val Leu Cln CLy Ser 

27.* :*an Mh 

tat gac ctg cag gac ctg etc qcc c*»n qnt nciq ctq coc qca att cig 1254 
Tyr Asp Len Gl.'j h&p Leu Leu Ala Gin Ala Glu Leu rxo Ala He Leu 
3D0 " 395 400 4 03 

c»C a<r:ti i^fj < % .l.q hai; ftl.q aaa ttg age aa-t. g*r cgc «1.ft A<jg gtt> 130?. 
Jlia Thx Glu lieu A*n Leu Gin Lys i,eu Ser Aan Asp Axg lie Arg Val 
410 415 420 

qqq rj^n c]l.i) i;.lq OQU oqu Wl Ltl. CjAtJ fit I. g*a QCfj 0ft C <3*i* £14* 1350 

Gly Glu Val Leu Asn iZer lie Phe L'he Glu Leu Glu Ala ABp Glu Arg 
425 430 435 

gag one; aca gag i.r.l. caa cag ctt aac aag cct gag gtc ttg gag LftM 
Glu l*ro Thr Glu Ser Thr Gin Gin Leu Asn Lys Pxo Glu Val Leu Glu 
440 445 450 

gtg acc ctg aac cqc cca ttc ctg ttt get gtg tat gat caa age gee 104 6 
Val Thr leu Asn Arq Pro Phe Leu Phe Aln Val Tyr Asp Cln Sor Ala 
455 4 60 4 fro 

act gee ctg ea£ LLC r.tg ggc cgc gtg gec aac ccg ctg age aca gca 1434 
Thr Ala Leu Wis L»he Leu Gly Arg Val Ala Asn Pro Leu Ser Thr Ala 
470 475 480 465 

tgaggecagg gccccagaae acagtgcctg gcaaggcctc tgcccctggc ctttgaggca 1554 

aaggecagea gr.ai}.<n*A<-a ar.fsccggaca aatcagegat gtgtcacccc cagtetccca 1614 

ccttttcttc taatgagtcg actttgagct qciaaagcagc cgtttctcct tggtctaagr 167 4 

gtgctgcatg gagt.qtu^riq, t.agaagcctg cagcggcaca aatgcacctc ecagtttget 1734 

gggtttattt tagagaatgg gggtggggag gcaogaacc* qtqfcktaqtyi oqqqvctaet 17 94 

gttccaaaaa g a ai.t ogaccagctt gtttgtgaaa caaaaaagtg txcccttttc 1854 

aagttgagaa caaaaattgg gttttaowivit tti.^enqtot^c vtbfc'-t^cvt bqcvttogqt 1914 

ttgtatttag tgt.nl hqftnL gt.aagaacat gacctccgtg tagtgtctgt aataccttag 1974 

ttttttccac agatgettgt gatttttgcia coat^cqtq* w»qaLd^»« rt»ect^«rtt 2034 

tctgtttgaa tgcggMC^» tagctggtta tttctccctt gtgttagtaa taaaegtett 2034 

gecac '-2099 

<210> 7 

<211> 4B5 

<212> PRT 

<213> Hoomo sapiens 



WOW/71751 
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w 

oi00> 1 

Met Ar<j Lyft Arg Ala l»ro Gin Ser Clu Met Al* Ma Gly Val Bex 
1 5 50 IS 

Leu Art* Ala Thr lie i,eu Cys Leu Leu Aln Trp Aia Giy L*u Ala Ala 
20 30 

Gly Asp ^ r< 3 v< * 1 3le 9is A ' xo Phe His ieu Vnl I3c H '* (51lJ 

35 40 45 

Ser Thr Cys Civ Gin l.eu Ala Lys Ala Asn Ala Gly Lys Pro Lys Asp 
511 55 €0 

Fro Thr Phe II© Pro Alt* Pn> II* Gin Ala Lys Thr Sex Pxo Val A3p 
65 ™ "» 

Glu Lys Alo Urn G1» A«p Girj Leu Val Leu Val Ala Ala Lys Leu A.?t> 
85 90 *i> 

Thr Glu Asp Lys Leu lys ma tiet Val Gly fc*et Leu Ala Asn Phe 
HH> 105 3 30 

l.eu Gly The Axg He Tyr Cly Met Hi* 5*r Glu Leu Txp Gly Val Val 
115 1^0 125 

Bis Gly Ala Thr Val Leu Ser Wo Thr Ala Val -Phe Gly Thr Leu Ala 
130 135 140 

Ser J>ci> Tyr leu Gly Ala Leu Asp His Ttax Ala Asp Arg Gin Ala 
IdS L50 155 160 

lie Leu Gly V«L Pn> Trp Lys Asp Lya Asn Cys Thx Ser Aril Leu A$p 
165 170 1?S 

Ala His Lys Val Leu Ser Alo Ijmi Gin Ala Val Gin Gly Leu Lbu Val 
1P0 1B5 190 

Ala Gin Gly Arg Ala Asp Sex Cln Al» C,\n l*» Leu Leu Ser Thr Val 
195 200 205 

Val Gly Val Phe Thr Ala Pro Cly hw K:i* T«u Lya Gin Pro Phe Val 
210 21b 220 

Gin Gly Leu Ala Leu Tyx Thr Pro VaJ Val l.fcu Pro Arg Ser Leu Aiap 
225 230 235 240 

Phe Thr Gin Leu Asp Val Ala Ala Glu Lys lie Adp Arg Phe Met Gin 
245 ?-50 255 

Ala Val Thx Gly Trp Lys Thr Gly Cys Sex Leu Thr Gly Ala Ser Val 
260 265 270 

Asp Sex Thr Leu Ala Phe Asn Thr Tyr Val Ms The Gin Gly Lys Met 
275 280 285 

l.y» Gly Phe Her Leu Leu Ala Civ Pru GJ n Glu Phe Trp Val Aap Asn 
290 295 300 

Ser Thr Sur VaL 5«r Val Pro Met Leu Scr Gly Mel Gly Thr Phe Gin 
305 320 
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Trp Uar Asp lie Gin Asp Asn Pho Scr Val Tlir Gin V w I Pm Phe 
325 :*3H 33* 

Thr Glu Ser Ala Cys Leu Leu Lou lie Cln Pzu H*i* Tyr A 1.9 S«r A?;p 

Lev A*i> T -y rt V *J ^1'^ Klv Leu Thr Phe Gin Gin Asn Sor Leu Asn Trp 
3D£> 360 36b 

Met Lya J.y* 1«» &*r Pro Axg Thr lie his Leu Thx Met Pro Gin Leu 
370 375 330 

Val Leu Cln Gly Scr Tyr nap Leu GIij Asp Leu Leu Ala Gin Ala Glu 
385 390 395 400 

Leu Pro Ala lie Leu His Thr Glu Leu Aim Leu Gin Lys Leu ser Asn 
4I>* 410 415 

Asp Axg lie Arg Val Gly Clu V*J 1.«u ^jh $*tr II a The Glu Leu 

420 420 4 30 

Glu Ala Asp Glu Arg Clu Pro Thr RJ.u S*r Thr Gin Gin Leu Asn Lys 
435 440 445 

Pro Glu Val Leu Glu Val Thr Leu Acn Arg Pru Ph<s l«u Wi* Ala Val 
450 455> 460 

Tyr Gl ■» Rfcr Ala Thr Ala Leu J3i3 Phe Leu Cly Arq V«il Al» Ayn 
4 410 475 a BO 

Pxo Leo 5i*r Thr Ala 
485 



<210> 6 
<211> 6B1 
<212> DNA 

<213> Hdwo ^piorMs 
<22D> 

<221> mis cofeature 
<222> (151)., (531) 
<223> a Iterate exon 1 of AGT gene 

<400> 5 

egggegggga aggggggagg aagggggnct gcigactct.ee tQMU:tfil:A t.gctatgatg 60 

gg&tttgacc tgutittttga* cctttcctca aacacaaggc acaaacctgc ccctKZcO&*t'* 120 

catcctccica cagaggcact gtgaafcgtat aaagsaaqqi; tyt (.gat flag* aacgtgctgc 180 

aggggaggct ggcacattct gctccitage taaccgcaca ggcccatggg tccacacaca 240 

tcagaccatg gctctct<jfrj gtctatcttt gecattaett taar.:carjw:l'. gfcttctcatg 200 

cgctcatgga gggtgactga agat peaces ecrgaggtgea ccctcagcct cgctggacac 360 

ttncqqctw cwt-tctcipig oeggaccagg cngttrioqqa *gU;M:i;gaa gtccagctgg 420 
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a get cc teat I gagcracaatc agetgeagga ggtggctgct cagctgaaag 480 

lUM.Qt.gggga caaaggagga gacccaggc^;. U*|c?LQLaLc: ogagocr;cga gqtgagtctg 5>40 

ccacDtttij^ij qaACfctccca ggcagctgag gagagggcrc nqcwjinjq^w qucc»i;Uil;tl 

cactcctcan. ai3<."ci<iiwl- i: tfiifcagaccea, gtgggactgc tgccrttggt gtgantgcag 6<30 
<:ql.Qrigagtg tgaatctgtg g €61 



<21(J> 9 
<211> IB 
<212> DNA 

<213> Art.il IftiAi Sequence 
<220> 

<223> Description of Artificial Sequence :ti!3 forward 
primer 

«00> 9 



<210> 10 

<hii> is 

<212> DMA 

<213> Artificial Sequence 
<22D> 

<223> Description of ArU I'lfcial Sequence :Hl 3 reverse 
pr i infer 

<40O> 10 

aggaaacagc tatqacnert 19 



<210> 11 
<211> 7.0 
<212> DNA 

<213> Homo sapiens 
<oon> 11 

cctcgaagtg gcctatggaa 21) 



<210> 12 

•C211> 20 

<212> DKA 

<223> Homo sapiens 

*400> 12 

gegagactet atgggtgatg 20 



<210> 13 

<211> 20 

<2l2> DNA 

<213> Homo sapiens 



WO W?17SJ 
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<40il> 13 



<210> 14 
<211> 19 
*.2:t2> CNA 

<213> homo Eapiens 
<400> Id 

uggggtggca tcttcagtc 19 



<210> 15 
<211> 20 
<212> nNA 

<213> Bono sapiens 
<400> 1* 

tciTCtfi.-Al.te: T.g^tccttag 20 

<210> 16 
<2U> 20 
<212> UNA 

<213> Homo trap 1 arm 
<a£>0> 16 

atacagcgarr tcctgggtck 20 



<210> 17 
<21L> 20 

<213> flcmo sapiens 
<40U> IV 

cjAccaggcag ttgaggaagt 2D 

<%10> 16 

<211> 20 

<212> DNA 

<2D3> KOirtf> Jul pi mm 

<400> IB 

gattcacact ctcacgotgc 20 



<210> 13 
<211> 20 

<2l3> >iamo sapiens 
«flW» 19 



<21Q> 20 
<21J.> 2D 
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<2L2> DNA. 

<21J> Htfiim .sapiens 

f.«O0> 20 

ctsctattt:n tcfttgatggc 



<:210> 21 

<:213> 20 

<2l2> UNA 

<213> Homo s cvpi on $ 

<4D0> 21 

agg ccc at g t g a ga t at g 



<2U1> 22 
<211> 20 
<212> CNA 

c213> Homo sapiens 



2D 



20 



<400> 22 

tggtattagg gcactgaagg ^° 



<210> S3 
<211> 20 
<212> DtfA 

<213> Jicono sapiens, 
<d00> 23 

tgtgttgaag ©aggatcttc 20 

<21D> 24 

<213> 20 

<212> DMA 

•C213> Homo sapiens 

<:400> 24 

O10> 25 

<21L> 21 

<212> IWA 

<213> Homo ftapleft* 

<4O0> 25 

gtgtttaaca gtotcc^cj^ C 2i 



<210> 26 

<211> 20 

<212> DNA 

<213> Bono aaplena 

<4O0> 26 
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<210> 21 

<m> is 

<2I2> UNA 

<213> Bomu Art pi tns 
<40D> 27 

catgtccctg t<m<;i:l.<:Lr. 1^ 



<210> 28 
<211> 10 

<213> Homo sapiens 
<400> 

l:gAggggtgg ggatggag 18 



<21H> 2$ 
<2il> 20 
<:212> DNA 

<213> Homo *»fiifcrlft 
<400> 29 

ctggtcatqt ijaAfti:ttacc 20 

•c210> 30 
<2H> 20 
<212> DMA 

<213> Honio #*pi«n.»; 
<*DD> 30 

accttcagct gctccaaaga 20 



<210> 31 
<211> 20 
<212> DNA 

<213> Hocno sapiens 
<:«0> 31 

cC£Ctt.lr.ca gcatgttacc 20 



C210> 32 
<211> 20 
<212> DMA 

<213> Homo sapiens 
<4 00> 32 

ctgctattcc tcctgatggc 2tf 



<21fl> 33 

<211> IB 

<212> D»A 

<213> Hcpcp wi>5ft/ift 
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<4DQ> 33 

aaccctcctc tccsigcct 



<<na> 34 

<21i* L8 
•C212> DNA 

<400> 34 

tr.-^Uv9i:i':^:. gcctcctc 



<210> 35 

c2ll> 20 

<212> DNA 

<213> Borno yo]>ivn4 

<4 00> 35 

tacttggact ttggcictcpn 20 



<210> 36 

<211> 21 

<212> DWk 

<213> Homo sapiens 

<400> 35 



<onn> :i? 

<211> 21 

<212> DNA 

<233> Hi^iwj sapi&ns 

<4 00> 37 

ggctaaatgg tg<ica<ig*?cu» I. 



<210> 3B 
<211> 20 
<212> 3>W 
<213> Homo 

<4 0Q> 30 
lagggccttit 



sapiens 
tc at ccac a g 



<?L0> 39 
<2ll> 20 
<212> DNA 
<213> Homo 

<:4 0O> 33 
aatgccggga 



sapiens 
agcccaaaga 



<210> 40 
<211> 20 
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<212> DHA 

•c213> Bcmo wplcns 

iccaaggctc ccaqntcKiaci 2ti 



c2i0> 41 
<2D1> »0 
<2T2> DNfl 

<213> homo sapiens 
<400> 41 

c:i:r.f.c^rrr:aa cggctgtctt. 20 



*.2l0> 42 
<211> 20 

<400> 42 



<210> 43 

<2ll> 20 

<212> DNA 

<213> Homo sapiens 

<40Q> 43 

ctctatctgg gagccttgga 20 



<21€> 44 
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